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Introduction. When Darwin developed the theory of evolution by 
natural selection, practically nothing was known of hereditary differ- 
ences beyond their existence. Since 1900, a body of knowledge on the 
mechanism of heredity and on mutation has been built up by experi- 
ment that challenges any field in the biological sciences in the extent 
and precision of its results. The implications for evolution are not, 
however, immediately obvious. It is necessary to work out the statisti- 
cal consequences. 

Studies in the field of statistical genetics began shortly after the 
rediscovery of Mendelian heredity in 1900. Those of J. B. S. Haldane 
[7] and R. A. Fisher [4] have been especially important with respect 
to the application to evolution. My own approach to the subject came 
through experimental studies conducted in the U. S. Bureau of Ani- 
mal Industry on the effects of inbreeding, crossbreeding and selection 
on populations of guinea pigs [21, 22, 23, 37] and through the attempt 
to formulate principles applicable to livestock breeding [19, 20, 24, 
25, 13, 34]. On moving into the more academic atmosphere of the 
University of Chicago, I have become more directly concerned with 
the problem of evolution. 


I should note that the deductive approach, to which I shall con- 
fine myself here, involves many questions that can only be settled by 
observation and experimental work on natural populations and that 
a remarkable resurgence of interest in such work is in progress [2, 9]. 


Postulates. It will be desirable to begin with a brief review of the 
more important factors of which account must be taken. 

The basic fact of modern genetics is that heredity can be analyzed 
into separable units, “genes,” whose most essential property is that of 
duplicating themselves with extraordinary precision, irrespective of 
the characteristics of the organism in whose cells they are carried. We 
shall restrict consideration to changes in the system of genes and ag- 
gregates of genes (chromosomes). There are relatively rare and ob- 
scure hereditary changes which must be attributed to other cell 
components but our knowledge of these does not warrant the elabo- 
ration of a statistical theory. 

Fortunately the same theory applies to a large extent to gene muta- 
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tions and to most classes of grosser chromosomal changes (duplica- 
tions, deficiencies, inversions, translocations; and so on). It will be 
assumed here that a given kind of mutation occurs at a constant rate 
per generation. Observed rates in organisms as remote as corn plants, 
vinegar flies and man are of the order of 10- or less per generation. 
Reverse mutation may occur at measurable rates. 

It is simplest to deal with mere pairs of alternative conditions 
(alleles) but the theory remains seriously inadequate unless capable 
of extension to multiple alleles. 

In general I shall assume that the reproductive cells are haploid 
(that is, contain just one representative from each set of alleles) and 
that their union results in diploid individuals (with two such repre- 
sentatives in all cells, until reduction occurs in the formation of the 
germ cells). This is the usual case but there are species in which other 
situations prevail (tetraploids, hexaploids, aneuploids, and so on). 
The group of sex linked genes constitutes an important special case 
in many otherwise completely diploid organisms (including man). I 
shall not go far into the extension to these cases. 

It is simplest to assume that the members of different series of 
alleles are distributed at random in the reduction division by which 
the reproductive cells receive a half sample of the genes of the individ- 
uals producing them (that, for example, individual AaBd produces 
germ cells AB, Ab, aB and ab in equal numbers). The phenomenon 
of partial linkage, exhibited by genes carried in the same chromosome 
should, however, be taken into account. These are the principal postu- 
lates as far as the mechanism of heredity is concerned though others 
are required in special cases. 

The relations of genes to observed characteristics are important. 
In general, any measurable character is affected by genes at many loci 
and a single gene often has multiple apparently unrelated effects. The 
effects of genes in combinations are often roughly cumulative but 
marked exceptions are also very common. Account must be taken of 
noncumulative effects within series of alleles (dominance) and be- 
tween series (gene interaction). 

The breeding structure of the population is important. The situa- 
tion in nature is so complex that models must be chosen that are com- 
promises between mathematical simplicity and biological adequacy 
[35]. I shall introduce only the simplest models in the course of the 
present discussion. 

Natural selection is an exceedingly complex affair. Selection may 
occur at various biological levels—between members of the same 
brood, between individuals of the same local population, between 
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such populations (as through differential increase and migration) and 
finally between different species, a subject that carries us outside the 
field of genetics and which has been discussed mathematically by 
Lotka [12], Volterra [17] and Nicholson and Bailie [14]. Selection 
among individuals may relate to the mating activities of one or both 
sexes, to differences in rate of attainment of maturity, to differential 
fecundity and to differential mortality at all ages. Selection may act 
steadily or may vary both in intensity and direction in different re- 
gions and at different times. Again I can only deal here with the 
simplest models. 


Gene frequency. In such a complex situation, verbal discussion 
tends toward a championing of one or another factor. We need a 
means of considering all factors at once in a quantitative fashion. 
For this we need a common measure for such diverse factors as muta- 
tion, crossbreeding, natural selection and isolation. At first sight these 
seem to be incommensurables but if we fix attention on their effects 
on populations, rather than on their own natures, the situation is 
simplified. Such a measure may be found in the effects on gene fre- 
quency in each series of alleles. 

Because of the complete symmetry of the Mendelian mechanism, 
gene frequency has no tendency to change in an indefinitely large 
closed population not subject to mutation or selection. Each homo- 
zygote (for example, A1A1, A2A2 or A3A3) produces only one kind of 
germ cell. Each heterozygote (for example, A1A2, A1A3, A2A3) pro- 
duces two kinds in egual numbers. In a population in which the array 
of gene frequencies is (q:A1+ 9242+ --- + QmAm) (letting the g’s rep- 
resent the frequencies, and the A’s the genes) the frequencies of geno- 
types come to equilibrium according to the terms in the expansion 
of (q:A1+q2A2+ --- +GmAm)? in the first generation of random mat- 
ing after attainment of equality of gene frequencies in the sexes [8]. 
Under sex linkage [10, 15] and in polyploids [6] equilibrium is not 
reached at once but is rapidly approached. Inbreeding and assorta- 
tive mating change the relative frequencies of homozygotes and 
heterozygotes but not the gene frequencies. 

One immediate consequence of this persistence of gene frequencies 
is that variability tends to persist. But the slightest continuing un- 
balanced pressure on gene frequency tends to cause cumulative 
change. It is obvious that recurrent mutation, immigration, selection, 
and the accidents of sampling in an isolated population of small size 
are all factors that can bring about such change. 

The frequencies of combinations of different series of alleles (for 
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example, A, a; B, b) reach equilibrium in a random breeding popula- 
tion in which gene frequencies are constant only when the genes are 
combined at random (terms of [(1—ga)a+qaA |?[(1—gqs)b+q2B]?). 
Equilibrium is not reached immediately, however. The departure 
from equilibrium is halved in each generation of random mating in 
the case of two pairs of alleles in different chromosomes. In general, 
the departure is reduced by the proportion ¢, where ¢é is the mean 
chance of recombination [18, 11, 16]. 


Systematic changes of gene frequencies. The rate at which gene 
frequency changes under recurrent mutation is obvious [27]. Let g 
be the frequency of the gene and wu the rate at which it mutates to 
its alleles as a group and Ag the rate of change of g per generation 
Aq= —1q. 

If reverse mutation occurs at the rate v per generation, the net rate 
of change of g is 
(1) Aq = o(1 — q) — 4g. 

In the case of multiple alleles, v is the weighted average for the vari- 
ous alleles of the gene in question and is thus a function of their rela- 
tive frequencies. It is, however, independent of g. 

The effect of crossbreeding is similar if we adopt the simplest model 
[27]. If a population with gene frequency g exchanges the proportion 
m each generation with a random sample of immigrants from the 
whole species (gene frequency g;) the rate of change in gene frequency 
is 
(2) Aq = — m(q — q). 

In actual cases the immigrants are not likely to be a random sample 
from the whole species but to come largely from neighboring popula- 
tions. Effective m is thus, in general, smaller than the apparent 
amount of immigration and is not necessarily the same for all loci. 
There may also be selective migration. The simplest model must 
suffice here. It permits identification of the theories of mutation and 
immigration by substituting mq, for v, and m(1—q,) for u. 

The effects of selection have been considered extensively by Haldane 
[7] in terms of the frequency ratio (¢/1—g) and by Fisher [4]. As 
there can be no selection pressure without at least two alternatives, 
any expression for it, applicable to all values of g, must include the 
factor g(i—g), excluding certain limiting cases. Thus the form 
aq(i—gq) has been used by Fisher as the basis for general discus- 
sion. For the present purpose somewhat less general forms are more 
useful. Consider first the case of a random breeding population of 
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diploid individuals in which the combinations of paired alleles may 
be assigned constant relative selective values [30]. 


Genotype Frequency (f) Value (W) 
AA Waa 
AA’ 2q(1 —q) Waa‘ 
(1 —q)? 


For the frequency of A after a generation 


qi = [Waag? + Waa-g(i — 


where 
= Wasg? + — g) + Wara(1 — 
1—q) dW 
Aq = 
2W dq 


Selection, however, really applies to the organism as a whole not 
to single series of alleles. If the population is heterallelic in m pairs 
of pertinent alleles, the number of possible combinations is 3". Each 
of these has a certain frequency and a certain relative selective value, 
the latter of which we here assume to be constant. If the three phases 
in the A series of alleles are combined at random with the combina- 
tions of the other series, the average selective values of AA, AA’ 
and A’A’ are independent of g4 although functions of the other q’s. 
Thus 


qa(1 — qa) OW 
2W Oga 


We have assumed only pairs of alleles, but as any group of alleles 
may be treated formally as one, this formula may be applied to multi- 
ple allelic series. The selective values Was, Waa and Wa-4: are then 
functions of the relative frequencies within the group of alleles of the 
gene under consideration, but not of qa. 

In previous general discussions (for example [35, 36]) I have re- 
stricted myself to this convenient model of selection pressure. As this 
has given rise to misapprehension [5], it should be emphasized that 
it applies only under the conditions implied in its derivation. 

If there are selective differences between the sexes, as is very likely 
to be the case, there are departures from random combination within 
series of alleles. These are, however, unimportant for most purposes 
unless there is rather strong selection. 


(4) Aqa 
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Selection itself tends to bring about departures from random com- 
bination among different series of alleles. Again the effects are unim- 
portant in most cases, especially if all relative selective differences are 
slight. 

The formula must be written in a more generalized form to include 
polyploidy [32] and sex linkage. For small selective differences in a 
2k-ploid 


ga(1 — qa) aw 


5 A 
) 2kW aga 


This applies (approximately) under sex linkage if k=3/4 and 
W =(W,.W;)"? where W, and Wy; are the mean selective values in 
males and females, respectively. 

There may be departures from random mating because of a con- 
stant tendency toward mating of relatives, giving the following geno- 
typic frequencies within a series of alleles [23, 3]. 


Genotype _ Frequency 
AA (1—F)q?+ Fg 
AA’ 2(1— F)q(1—g) 
A'A’ (1—F)(1—g)?+F(1—q) 


Random combination between series of alleles is not disturbed ap- 
preciably if the selective differences are small or if the inbreeding 
coefficient F is small, giving the following formula in which Wz and 
W, are mean selective values of the random bred and inbred compo- 
nents of the frequencies relative to the A series [3] 


2W OgA Oga 


Inbreeding that leads to subdivision into partially isolated groups 
is best dealt with by a different mathematical model. 

Under assortative mating based on similarity in characteristics 
there are very great departures from random combination of different 
series of alleles [20]. Again we may best consider such a mating sys- 
tem as one leading to subdivision of the population into partially iso- 
lated groups. 

Returning to consideration of random breeding populations, it may 
easily be seen that if the W’s are functions of qa 


| 
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2W Oga 


Intra-brood selection is an example of a case in which OW/dq4 =0 
but gene frequency nevertheless changes. As in our model case, the 
process does not necessarily lead to fixation of the most favorable of 
the genotypes possible from the genes present in the population. 

Finally, if the genes under consideration affect the system of mat- 
ing itself, as is the case of the self-sterility alleles of many plants [33] 
or of genes that determine self fertilization [5], the changes in gene- 
frequency can only be found from the composition of the population 
in successive generations. 

We may note here that while, in principle, selection must be con- 
sidered to apply to the organism as a whole, one may analyze the 
organism into character complexes which evolve largely independ- 
ently through changes in largely independent systems of genes, the 
components of which are distributed at random among the chromo- 
somes. The case which we have chosen as a model of selection pres- 
sure (4) and its generalization (5) should apply sufficiently well to 
most reaction systems in freely interbreeding populations and gives 
an insight into certain aspects of the effects of selection which cannot 
be obtained as easily from the more complex special cases. 

If a character complex is affected by m loci and m; alleles at a par- 
ticular locus, it requires 


> (m: — 1) 


dimensions to represent the system of gene frequencies and 


Il [m:(m; + 1)/2] 


i=1 


kinds of genotypes are possible. Assuming that the relative selective 
values of these genotypes are independent of their frequencies, the 
mean selective values (W) of possible random breeding populations 
form a surface relative to this multi-dimensional system of gene fre- 
quencies, the gradient of which determines the way in which the pop- 
ulation tends to change under the influence of selection. The number 
of loci that may affect even the simplest characters are known in 
certain cases to be great and many, if not all such loci are probably 
represented at all times by multiple alleles. Thus the number 
[]?-1m; of homozygous types possible from genes actually present in 
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a species and affecting a particular character complex may often run 
into astronomical figures. Under these conditions it is to be expected 
that in general the surface W for any character complex will have 
numerous peaks, corresponding not only to different combinations of 
genes that give the same character [29] but also to different harmo- 
nious combinations of elementary characters that permit the organ- 
ism to overcome the same conditions in different ways. 

Mutation, immigration and selection may all be occurring simul- 
taneously. The net rate of change of gene frequency may be obtained 
by simply adding the contributions of these factors (1), (2), (4), if 
these are small. In our ideal case of a random breeding population of 
diploid individuals subject to reversible mutation, immigration and 
constant selective differences between genotypes [27, 30] 

oW 
(8) Ag = o(1 — g) — — — + 
There is equilibrium, stable or unstable, if Ag=0. With reversible 
mutation, there must be at least one gene frequency other than 0 or 1 
that is in stable equilibrium. There may also be stable equilibrium 
as a result of opposing selection pressures alone. 


Accidents of sampling. There is another possibility of change of 
gene frequency to be considered. In a population that is not indefi- 
nitely large, gene frequency may be expected to change from genera- 
tion to generation merely from the accidents of sampling. The 
composition of a population of N diploid individuals depends on 
that of 2N gametes produced by the preceding generation. If these 
are a random sample from the array [(1—g)A’+¢qA ], the probability 
array for values of g in the next generation is [(1—q)A’+qA ]?¥ with 
the standard deviation (q(1—gq)/2N)1/?. We will call a random devia- 
tion of g of this sort 6g in contrast with the systematic deviation Ag 
produced by mutation, migration or selection 


) 


It might seem that these random deviations would be negligible in 
any reasonably large population but in the absence of any systematic 
pressure toward equilibrium, the squared standard deviation for later 
generations increases approximately with the number of generations 
until there is an approach to the limiting value g(1—g) of complete 
fixation, [(1—q)A’A’+qAA ]. The exact value for the mth generation 
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is g(1—g)[1—(1—1/2N)*]. The rate of fixation of heterallelic genes 
approaches 1/2N per generation. 

The effective value of N should often be much smaller than its ap- 
parent value [27, 35]. It obviously applies only to individuals that 
reach maturity. If there is cyclic variation in population size, N is 
more closely related to the minimum than to the maximum number. 
It is also reduced if there is excessive variability in the number of 
mature offspring from different parents. 

In a 2k-ploid population [32], 

2 — q) 
(10) approximately. 
For sex linked genes it is approximately g(1—g)[2/9N;+1/9N | 
where JN; and N,, are the effective numbers of females and males, re- 
spectively, and thus is 2g(1—q)/3N if these are equal. 


The distribution of gene frequencies in the case of equilibrium. 
The tendency toward a stable equilibrium in the value of g, found 
when there are opposing systematic pressures, and the tendency to 
drift away from this point, due to the sampling variance, should re- 
sult in a probability distribution which one might expect to find real- 
ized by the values taken by the frequency of a particular gene over 
a long period of generations in the ideal case of a population in which 
all conditions remain constant. It would also be the distribution of 
values of g taken by this gene at a given time in an array of completely 
isolated populations, all of which are subject to the same conditions. 
Finally, all genes subject to systematic pressures of the same magni- 
tude should exhibit such a distribution at one time in a single popu- 
lation. While these are ideal cases, not likely to be approached in 
actual cases, it is of primary importance in the genetics of populations 
to be able to reach conclusions on the nature of such distributions. 

The distribution of gene frequencies in the case of equilibrium must 
satisfy the conditions of stability of the mean 


(q + Ag + 8g = 9) 
and stability of the variance (02449484) =9%). The possible values of g 
must range from 0 to 1. It is conyenient to use integration for sum- 
mation in expanding these expressions. Let ¢(g) be the ordinate of 
the required distribution. The formula of the distribution may be de- 
rived as follows [30, 31] 


f (q + Aq + = f 9¢(q)dq, 
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1 1 
0 0 


Since the mean value of 6g is 0 and since 6g is not correlated with 
(q+Agq), these reduce to the following, omitting a term involving 
(Ag)?, negligible if Ag is small 


1 
f Aq¢(q)dq = 0, 
0 
1 1 
2 (q — + f 


Let {Ag¢é(g)dq =x(q) and integrate the first term of the preceding 
equation by parts 


x(1) — x(0) = 0, 


1 1 
f x(qdq — [ax(0) + (1 — @)x(4)] — (1/2) f 
0 0 


It may be found by trial that both of these conditions are satisfied 
by the following equation if ¢(0) and ¢(1) are finite. Note that 
a3, =0 if g=0 or if g=1, since there can be no sampling variance un- 
less there are alternatives 


x(q) — x(1) = (1/2) 


dx(q) 2Aqdq 
d 1 1 = = 
og [x(q) — x(1)] 


Cc )d 
ix(@) — x()] = +e 
2f dda 


(11) $(q) = (C/os,)e 


where C is a constant such that [/¢(q)dq=1. 

The frequency of a particular value of q is approximately f(q) 
=(q)/2N. The amount of exchange between the subterminal and 
terminal classes is approximately half the frequency of the former 
from consideration of the Poisson distributions of the classes that are 
close to fixation [27]. 

For the model case in which Ag = 0(1 — g) — ug — m(q — qs) 
+(q(1—g)/2W)(@W/dq) and o3,=q(1—g)/2N equation (11) reduces 
to the following 
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f(A) = fl — 1/2N)/4N [m(1 — + 
\ 
Fic. 1 
\ i 
\ 


Fic. 3 Fic. 4 


Figures 1 to 3. Some of the forms taken by the distribution of frequencies of a 
completely recessive gene. Mutation rates are assumed equal in both directions (u =v). 
N=1/40v in Figure 1, 10/40v in Figure 2 and 100/40v in Figure 3. In each case the 
solid line represents the least selection (¢= —v/5), the long dashes represent selec- 
tion 10 times as severe (omitted in Figure 1 since practically indistinguishable 
from the preceding) and the short dashes represent selection 100 times as severe 

Figure 4. Frequency distribution of a semidominant gene in subgroups of a large 
population in which the varying conditions of selection among subgroups has lead toa 
mean gene frequency, g:=.25. The subgroups represented are assumed to be of the 
same size (N= 1000) and subject to the same selection pressure (Waa=1, Waa: =.9975, 
Waa =.995) but to different degrees of isolation (long dashes: m=.01, short dashes: 
m= .001, dots m= .0001), = — g)3000m—1, 


This brings the effects of reversible mutation, crossbreeding, selec- 
tion and size of population into a single formula. Figures 1 to 4 show 
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the forms taken by this distribution in certain special cases. The 
U-shaped distributions in small populations (Figure 1) may be com- 
pared with the J-shaped ones in large populations (Figure 3). The fig- 
ures bring out the relatively slight effects of selection in small popu- 
lations. 

The joint frequency distribution for multiple pairs of alleles may 
be written as follows for the model case that we have been consider- 
ing [30] 


(14) Qn) cw |] qi gi) + us 
it=1 

This applies to 2k-ploids if 4Nk is substituted for 4N in the expo- 
nents of g; and (1—g;). In the case of sex linkage and equal numbers 
of the sexes, 3.N is to be substituted for 4N in these exponents. As the 
exponent of W is not affected in these cases, the formula applies to 
joint distributions including different degrees of ploidy (aneuploids) 
and both autosomal and sex-linked genes. 

Figure 5 illustrates the frequencies along two diagonals of the joint 
distribution for two pairs of alleles which act cumulatively on the 


Fic. 5 


The frequencies along the diagonals of the joint distribution for two series of alleles 
with equal and additive effects on a character on which adverse selection acts accord- 
ing to the square of the deviation from the mean (W4ss=Waaps= Waocpa=1, 
= Wacw = Waav,=1—4s). The solid line shows 
the frequencies along the line connecting the two favorable types, AAbb and aaBB. 
The dashes show the frequencies along the line connecting the extreme types AABB 
and aabb. In the case shown, =, N=1/200, s=5va. 


same character of which the midgrade is optimum. There are two 
peak frequencies corresponding to approximate fixation of two differ- 
ent genotypes that give the midgrade of the character. In cases in- 
volving large numbers of genes there may be an indefinitely large 
number of peak frequencies. 


The distribution of gene frequencies under irreversible mutation. 
It is also important to determine the form taken by the distribution 
of gene frequencies when fixation of one of the alleles is an irreversible 
process. The distribution curve should reach constancy of form, but 
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all class frequencies (except that in which fixation is occurring) should 
fall off at a uniform rate K. The conditions may be expressed as fol- 
lows [31] 


q+ 4q+ =(1—K)g+K, 
(1 K)o, + K(1 a, 
K = (1/2)f(1 — 1/2N). 


It can be found by trial that with mutation at rate v, no selection, 
and o3,=q(1—g)/2N those conditions are satisfied by the following 
equation, with decay at rate K =» per generation [31] 


(15) = 


I have not been able to obtain a general solution comparable to 
equation (11) but formulae have been obtained for an important 
class of cases by another method [27, 30, 31]. Random breeding 
diploid populations with frequency array [(1—g)A’+qA] are dis- 
tributed in the following generation according to the expression 
[(1—qg—Ag)A’+(q+Agq)A Letting p=1-—g, the contribution to 
the frequency f(g.) of populations characterized by gene frequency g., 
is thus !] (pb —Ag)?% “f(q). The condi- 
tion that this frequency be reconstructed after a generation except 
for a reduction by the amount K can be represented sufficiently ac- 
curately as follows: 

If K =0, Ag =0, this equation is satisfied by ¢(g) = Cq-'+ D(1—g)-! 
for any values of C and D. For irreversible mutation at rate », it 
yields equation (15) and for migration and reversible mutation, but 
no selection, the same result as obtained from equation (12) [27, 31]. 

Selection pressure gives more difficulty. An important case that can 
be solved is that for very rare mutations (4Nv negligibly small) sub- 
ject to selection pressure of the fairly general form Ag = (s+ég)g(1—g). 
Here K may be taken as 0 


1 
2Nf(q.) = A — g(s + tq) + p(s + tq) 


where 
T(2N) 


(2N (2N Qe) 
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The following approximate relations may be used 
[(1 — g(s + = [1 — Np.g*(s + 
[(1 + p(s + tg) = — + 


Let - )/q(1—g) and use the 
approximate relation 


T(2N) 
f pe Pe—lg2Nae— 
qc) 0 


z z—1 z 
+ [ge ge | [x(x 1)/4N]. 
It may be found that C, = [(4N%s? + 2Nt)Cn-2 + 8N*stCn— 
+4Nt?C,,_4|/m(m+1) ignoring terms in which the exponent of N 
is less than the sum of those of s and ¢. After further reduction 


(16) f(g) = [Ceteat2N + 'y(2Nsq, 2Niq?)]/q(1 — 9) 
where C and D are any constants and (a, d) is as follows 


a” 


1 2a” 3a* 4a® 
7 2a 69a? 6a? 282a4 12a5 
5! 5! 7! 7! 9! 9! 
27 27a 348a? 204a? 


5s 321 132a 


2265 |+ 
he 


The special cases of most interest are those of irreversible mutation 
in one direction or the other and of equilibrium. Consider a popula- 
tion in which 2Ns and 2Nt have given values but N is indefinitely 
large and hence s and ¢ are indefinitely small. If mutation is occurring 
from the class g=0 at an exceedingly minute rate v with irreversible 
fixation in the class g=1, the frequency of the subterminal class, 
q=1/2N, must be approximately 4Nzf(0) while that of the other sub- 
terminal class (¢=1—1/2N) must be so much smaller as to be neg- 
ligible. The following are sufficiently accurate, letting f(0) =1 


= 
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D 
f(1/2N) = 2N = 
f(1 — 1/2N) = 4+ 2Nt)] = 0. 


As ¥(2Ns, 2Nt) is of the order of e?%*+¥!, D/2N is negligible com- 
pared with C 
C = 20, 
Qve2N stN t 


W2Ns, 


¥(2Ns, 21) 


(17) fq) = 


q(1 — q) 


In the subterminal regions selection is practically inoperative, 
Ag=s/2N, approximately for g=1/2N; and Ag=(s+#)/2N for 
g=(1—1/2N). The formula of the curve in the neighborhood of 
q=1/2N is therefore approximately 2v/g. Thus with given v but dif- 
ferent values of N, f(1/2N)(=4Nv) always falls very nearly on the 
same smooth curve. The relation between the selection pressure and 
the sampling variance (which measures the additive effect of sam- 
pling) is constant for given values of 2Ns, 2Né and gq, (Ag/o%,) 
=(2Ns+2Niq). Thus the smoothed probability curve, f(q), should 
be the same throughout with given 2Ns and 2M¢t irrespective of the 
values of N and of s, ¢ separately. Thus equation (17) is the general 
formula for the case of irreversible mutation from the class at g=0. 

The determination of ¥(a, b) in specific cases is a rather formidable 
task since it involves two variables and converges slowly. For the 
case of semidominance however, t=0 and ¥(2Nsq, 0) reduces to 
89) /4Nsq 


20 (1 — 


This agrees with a result obtained by Fisher [4] by a different 
method, involving a transformation of scale (@=cos—! (1—2q)) de- 
signed to make sampling variance uniform, and expression of the 
conditions in the form of a differential equation. The chance of fixa- 
tion of a mutation in this case is given by the ratio of the subtermi- 
nal classes (f(1—1/2N)/f(1/2N)) and is thus 2s/(1—e~-*%*) where s 
is the selection favoring the heterozygote. This is practically constant 
(2s) in large populations. There is a small chance of fixation of even 
unfavorable mutations (2s/(e*"* — 1). Figure 6 illustrates the distribu- 
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20 
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tion curve for various intensities of selection (but with the selection 
favoring heterozygotes represented by (1/2)s instead of s). 


| 
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Figures 6 to 9. Distributions of gene frequencies under irreversible mutation at 
indefinitely low rates (v). Six cases are shown in each figure. Dash and two dots, 
s = —4/2N; dash and dot, s= —1/2N; solid line, s=0 (f(g) =2v/q in all figures); long 
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In the case of recessive mutations, s = 0 and Ag =iq?(1—q). The law 
of the series for ¥(0, b) is that En =(Em_1+Em_2)/2m(2m—1) where 
E,, is the coefficient of b”. The distribution curve for a number of 
values of 2Ni are shown in Figure 7 (but with s instead of ¢ as the 
selection coefficient). The chance of fixation was determined empiri- 
cally from the ratio of subterminal frequencies up to 2Nt=12. For 
larger values (up to 2Nt=64) it was more convenient to calculate it 
from the ratio of the flux (2 Ag f(g)) in the region of maximum Ag, 
(q=2/3) to the proportion of mutations (2Nz) (usable only for large 
Aq/s;,). For values of ¢ ranging from 4/2N to 64/2N the average 
chance of fixation came out 1.1(¢/2N)/?, apparently approaching 
(t/2.N)*/? and thus a function of N even in large populations contrary 
to the case of semidominance. 

In the case of dominant mutations, Ag = (s —sq)q(1—g) if mutations 
are taken as occurring from the class g=0. It is more convenient to 
assume that they are occurring from the class g=1 (Ag = —sq?(1—g)) 
since this merely requires evaluation of ¥(0, —2Nsqg?) instead 
of the two-dimensional series ¥(2Nsqg, —2Nsg?). From consid- 
erations analogous to those discussed above, it appears that for 
Ag =(s+iq)q(1—g) but irreversible mutation from the class at g=1, 
C=0, */(2Ns, 2Nt) to a sufficient approximation, 


V(2Ns, 


dashes, s=1/2N; short dashes, s=4/2N; dots, s=16/2N. The ordinate at g=1/2N 
is the same for all curves (f(q)=4Nv) and far above the range of the figures (except 
in the case s=16/2N, Figure 9, in which all ordinates are greatly reduced). 

Figure 6. The case of a semidominant mutation A’ (Waa=1, Waa’ =1+/2, 
Waa’ =1+5). The probabilities (P) of fixation of a mutation are as follows: For 
s=—4/2N, P=0.075/2N; for s=—1/2N, P=0.58/2N; for s=0, P=1/2N; for 
s=1/2N, P=1.6/2N; for s=4/2N, P=4.1/2N; for s=16/2N, P=16/2N; for large s, 
P=s. 

Figure 7. The case of a recessive mutation a, (Was=Wac=1, Wea=1+5). For 
s=—4/2N, P=0.12/2N; for s=—1/2N, P=0.70/2N; for s=0, P=1/2N; for 
s=1/2N, P=1.3/2N; for s=4/2N, P=2.3/2N; for s=16/2N, P=4.3/2N; for large s, 
P=(s/2N)"2. 

Figure 8. The case of a dominant mutation A (Wae=1, Wasa=Waa=1+5). 
For s=—4/2N, P=0.042/2N; for s=—1/2N, P=0.49/2N; for s=0, P=1/2N; 
for s=1/2N, P=1.9/2N; for s=4/2N, P=6.6/2N; for s=16/2N, P=31/2N; for 
large s, P=2s. 

Figure 9. Thecase of a mutation A’, selected only in heterozygotes (Waa = Wa'a’ =1, 
Waa =1+s). For s=—1/4N, P=0.23/2N; for s=—1/2N, P=0.71/2N; for s=0, 
P=1/2N; for s=1/2N, P=1.4/2N; for s=4/2N, P=3.2/2N; for large s, P ap- 
proaches 0 and all loci tend to become heterallelic. 
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Figure 8 shows the form taken by the distribution in special cases 
(with s as the selection favoring the dominant mutation and mutation 
taken as occurring from the class g=0). The chance of fixation of 
favorable dominant mutation with large N is approximately 2s which 
is the same as for favorable semidominant mutations provided that s 
is the selection favoring the heterozygotes in both cases. This is to 
be expected since homozygotes are relatively rare until mutation has 
passed through the point of maximum selection pressure. 

Figure 9 shows the distribution curve in the case in which there is 
no selective difference between the homozygotes but selection favors 
or opposes the heterozygotes. In thiscasein which Ag = s(1—2g)q(1—g), 
formula (17) was used for values of 2s up to 4. For large values of 
2Ns, there is so little fixation that the distribution under equilibrium 
may be used. 

By combining the formulae for irreversible mutation in each direc- 
tion in such a ratio that the amounts of fixation are equal, we obtain 
a distribution identical with that of reversible mutation occurring at 
rates at which 4Nv and 4Nu are negligible. The result agrees with the 
limiting value obtained from equation (12) 


c 2 
q(1 — q) 

The evolutionary process. I can go only briefly into the implica- 
tions for evolution. We must distinguish two processes (a) the trans- 
formation of a single population until it has become so different that 
a new species or higher category must be recognized and (b) the cleav- 
age of species. 

Consider first the possibilities of transformation in a very large, 
closed, freely interbreeding population, living under conditions that 
are the same on the average for thousands of generations [28]. In such 
a population, random changes in gene frequency are negligible. Gene 
frequencies can change only according to the systematic pressures of 
mutation and selection, a process which must stop when all Ag’s be- 
come zero, unless there is a flow of untried mutations that are favor- 
able from the first. We have a theory of the stability of species in spite 
of variability due to continually occurring mutations and in spite of 
continuous action of selection. 

A stable state of this sort may be far from being the most adaptive 
of the systems possible from the genes actually present. Consider here 
the situation with respect to characters to which our model selection 
pressure applies. As noted, many distinct peak values are to be ex- 


pected in the surface of selective values, W, relative to the multi- 
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dimensional system of gene frequencies. In a species located at a 
particular point in the system, each gene frequency will change un- 
til all Ag’s are zero. These changes will be such that the mean selec- 
tive value of the populations changes approximately by the amount 
AW= 2(Agd W /dq), the species moving up the steepest gradient in the 
surface W except as affected by mutation pressures [29]. It stops 
when AW =0, a point in the neighborhood of the peak toward which 
selection has been directed, but not at the highest point because of 
the mutation terms in the Ag’s. In general there will be other peaks 
on the surface W that are higher but the species cannot reach them. 

Perhaps, however, we have been too hasty in assuming that all Ag’s 
would ever reach zero simultaneously. It is probable that the poten- 
tial alleles at each locus form an indefinitely extended series in which 
any one allele can give rise to certain others, these to ones at two re- 
moves from the first, and so on. A continual flow of untried favorable 
mutations may keep the population in a state of flux. In general, how- 
ever, it would seem probable that the rate of the movement toward 
the equilibrium point indicated by the existing genes would be of a 
higher order of magnitude than the rate of elevation of this peak by 
the occurrence of mutations of this very unusual sort. 

In a population in approximate equilibrium, the variability due to 
the balance between mutation and selection is not likely to be great. 
If, for example, Ag=sq(1—g) —uq, g=1—u/s at equilibrium. As mu- 
tation rates are typically of the order 10~ or less, g is close to 1 if the 
gene in question has an appreciable advantage. Loci in which there 
are opposing selection pressures would contribute more to variability. 
This may occur where a heterozygote combines favorable effects of 
two genes (Ag of form g(1—g) [s2—(s1+52)q] with stable equilibrium 
at g=52/(s1+52) if both s; and s2 are positive). It may also occur where 
different homozygotes have advantages in different ecological niches 
occupied by the species in such a way that the net selective values are 
related inversely to the frequencies (a case to which our model applies 
only approximately). With only one or the same few alleles main- 
tained at high frequencies by the population the chance for the oc- 
currence of fixation of untried favorable mutations at several removes 
from those present is small. The extreme improbability of such muta- 
tions justifies use of the formulae for irreversible mutation at very 
low rates (4Nv much less than 1) in spite of the fact that we are con- 
sidering large populations (17), (19) (Figures 6 to 9). There is the 
possibility of an indefinitely continuing evolutionary process but the 
rate is restricted by the low probability of the necessary mutations 
and the incomplete utilization of the potentialities for adaptation pro- 
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vided by the genes actually present (cf. however Fisher [4] and cri- 
tique [26]}). 

Conditions are, however, continually changing. Selection of in- 
creased severity but unchanged direction merely carries the location 
of the system of gene frequencies closer to the peak and increases 
somewhat the chance for a novel favorable mutation to reach fixation. 
With secular changes in the direction of selection, on the other hand, 
peaks in the surface W may become depressed and low places ele- 
vated. In species which are sufficiently labile to avoid extinction, the 
system of gene frequencies is kept continually on the move. It is 
likely to be shuffled into regions of W that are in general the higher 
ones. This process is undoubtedly of great importance for evolution- 
ary change. 

In sufficiently small populations, the random divergences of gene 
frequencies from their equilibrium values become important. In very 
small populations, these tend to bring about approximate fixation of 
some random (and therefore, in general, non-adaptive) combination 
of genes (Figure 1). Moreover, while selection pressure is less effective 
in small populations than in large ones, mutation pressure remains 
the same. Random mutations are more likely to be degenerative than 
adaptive. Long continued reduction in the size of a population is thus 
likely to lead to extinction. On the other hand, the less extreme 
random variations found in populations of intermediate size (4Nv or 
4Ns of the order 1 to 10) (cf. Figures 2, 3, 5) act somewhat like 
changes in the direction of the selection. The system of gene fre- 
quencies is kept continually on the move and this gives a trial and 
error process which at times may lead to adaptive combinations which 
would not have been reached by direct selection. The rate of change 
of this sort is slow under the required conditions. 

Consider next a large population that is divided into many small 
partially isolated races. These may differ in size and degree of isola- 
tion and in the direction and severity of the selection to which they 
are subjected. The conditions are present for an extensive testing by 
trial and error of a relatively large number of alleles at each locus and 
of different combinations of these. 

Local differences in direction of selection are effective if the selec- 
tion coefficient s (writing selection pressure in the form Ag=sq(1—gq)) 
exceeds the crossbreeding coefficient m (cf. Figure 4). While these 
differences are primarily of merely local significance, there is the pos- 
sibility of acquirement of an adaptation that turns out to be of gen- 
eral value which then may spread throughout the species. Moreover, 
if different alleles come to be characteristic of different races, the store 
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of variability of the species as a whole is increased. This is also true 
of each local race as a consequence of crossbreeding. 

In races in which 4Nm is of the order 1 to 10 there is considerable 
random differentiation without the approach to fixation in equally 
small completely isolated groups. Such changes may occur with con- 
siderable rapidity in this case and while non-adaptive are not neces- 
sarily anti-adaptive to an appreciable extent. It is merely that the 
location of the system of gene frequencies on the surface of selective 
values in our model case is not constrained to move up the steepest 
gradient but may move up gradients that are not the steepest and 
occasionally even down hill. Among the many local races exploring 
the neighborhood of a peak in the surface of selective values, one or 
more may reach a gradient leading to a higher peak (cf. Figure 10). 


——— 
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\ 


Fic. 10 


The frequencies along the diagonal (0, 0) to (1, 1) of the joint distribution of fre- 
quencies of recessive genes, a and b in subgroups of the same size (V=1000) but 
different degrees of isolation (long dashes: m=.01; short dashes: m=.001; dots: 
m= .0001) in a large population in which the conditions are everywhere the same 
(each recessive with selective disadvantage of .001 relative to the type A —B— but the 
double recessive with an advantage .01 over type). With mutation rates u.=%= ts 
=v,= 10-5 there would be two positions of stable equilibrium, one at about (0.1, 0.1) 
which is assumed to hold for the major portion of the species for historical reasons, 
and one at about (.999, .999) which is more adaptive 


(go, 6) = C[1 — .002(ga + ge) + — — 


Local populations for which m=.01 vary only slightly from the frequencies character- 
istic of the species. Most of those for which m=.001 show a close approach to fixation 
of the type genes (AABB) but occasionally there is an approach to fixation of the 
more adaptive double recessive. Those with m=.0001 are largely homallelic in aabb. 
In the long run such superior subgroups might be expected to pull the whole species 
to this position by intergroup selection. The scales are not the same in this figure. 
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By expansion of numbers and excess migration such races tend to 
bring the species as a whole under control of this peak. Intergroup 
selection of this sort, with respect to racial differentiation that has 
jointly adaptive and non-adaptive aspects, seems to provide the most 
effective mechanism for testing many alleles at each locus and many 
combinations of these and is thus the most effective mechanism for a 
continuing evolutionary process [26, 27, 28, 29, 35, 36]. 

It should be emphasized that we are not concerned here with local 
races as incipient species. As long as isolation is incomplete the races 
are bound together by crossbreeding and thus are carried along by the 
evolution of the species as a whole although subject to the minor 
kaleidoscopic changes in character which according to this theory 
play a major role in the evolution of the whole. 

The cleavage of species depends on virtually complete isolation of 
portions of the species from each other. Even if there are no signifi- 
cant character differences at the time of separation and even if con- 
ditions remain substantially the same for the two portions, the process 
described above will insure that they drift apart. Each continues to 
be adapted to the conditions but in somewhat different ways. They 
may be expected to move to increasingly more remote peaks on the 
surface W. In the course of time genic and chromosomal differences 
may be expected to accumulate that prevent crossing and so clinch 
the specific distinction. Before this point is reached, the occasional 
occurrence of hybridization may transfer blocks of genes from one 
species to the other or lead to the origin of a completely hybrid spe- 
cies, presenting a mechanism of reticulate evolution, analogous to 
that described above but on a coarser scale (cf. [1]). 

Under certain conditions the multiplication and diversification of 
species may be a very rapid process. These include a relaxation of the 
general selection pressure on the species permitting great increase in 
numbers and great variability; the opportunity for the occupation of 
widely distinct ecological niches associated with almost complete iso- 
lation of the groups seizing these opportunities and with subdivision 
of these groups into partially isolated local populations. A species that 
is the first of its general kind to reach unoccupied territory finds most 
at least of these conditions realized. This is also the case with a species 
that by any means acquires an adaptation of first rate general sig- 
nificance which gives its subgroups an advantage over species already 
established in various ecological niches, that more than compensates 
for the initial lack of special adaptations for these niches. 

The most general conclusion that can be drawn from the attempt to 
develop a mathematical theory of the simultaneous effects of all sta- 
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tistical processes that affect the genetic composition of populations 
is that in general the most favorable conditions for evolutionary ad- 
vance are found when these are balanced against each other in certain 
ways, rather than where any one completely dominates the situation. 
Finally it may be said that the more detailed knowledge of heredity 
and mutation that is now available confirms Darwin’s general con- 
tention that evolution is a process of statistical transformation of 
populations. 
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MATHEMATICAL PROBLEMS CONNECTED WITH THE 
BENDING AND BUCKLING OF ELASTIC PLATES 
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1. Introduction. As part of his Gibbs Lecture of last year Theodore 
von K4rm4n [5] considered briefly a number of nonlinear problems 
concerning thin elastic plates. This address will include discussion of 
linear as well as of nonlinear problems concerning elastic plates, with 
special stress on a particular aspect of the nonlinear problems. How- 
ever, it is not my intention to give a more or less complete survey of 
the literature on the subject; my purpose is rather to present in some 
detail a few specific leading ideas and points of view and to call atten- 
tion to a number of unsolved problems. 

The mathematical problems connected with thin elastic plates be- 
long with the many problems from mechanics which have interested 
eminent mathematicians and from which mathematical ideas of fun- 
damental importance have originated. These problems are of at least 
equal interest in engineering. In fact, a considerable part of the recent 
interest in the subject of thin plates can be attributed to the need 
for numerical solutions of problems which arise in the design of the 
thin-walled structures used in aircraft. The latter problems are in the 
main nonlinear boundary value problems of an involved type about 
which almost nothing of a general nature is known, and, aside from 
the technical difficulties encountered in attempting to obtain numeri- 
cal solutions, such problems present questions of a’ purely mathe- 
matical nature which require elucidation. The latter part of this 
address is largely concerned with the mathematical problems which 
arise in this connection. 

The first attempts to formulate mathematically the problem of 
flexure of thin elastic plates were probably inspired by the experi- 
mental researches of Chladni in 1787 on the modes of vibration of 
thin elastic plates. During the next twenty years a number of un- 
successful attacks were made upon the problem by James Bernoulli 
the Younger and others. In 1809 the French Academy offered a prize 
for a theory of the vibrating plate, which, after some controversy, 
was awarded to Sophie Germain in 1815. Lagrange, who was a mem- 
ber of the prize committee, corrected the theory of Sophie Germain 
and derived the partial differential equation as we know it now. In 
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1822 Cauchy, prompted by his membership on a committee to con- 
sider a paper of Navier on plates, derived for the first time the gen- 
eral linear theory of elasticity substantially in its present form. It is 
noteworthy that a relatively complicated special problem in elasticity 
had been formulated, in the main correctly, before the general linear 
theory of elasticity was developed. One is tempted to enlarge upon 
the fact that mathematicians of those days felt no distinction between 
pure and applied mathematics and often found inspiration in quite 
special problems taken from mechanics. 


2. The linear bending problem. The next outstanding historical 
event in the theory of plates occurred in 1850 with the publication 
of a famous paper by Kirchoff. In order to explain the significance of 
the work of Kirchoff it is necessary to formulate briefly the problem 


< middle surtace 


bz 


Fie. 1 


of the bending of a uniform thin plate subjected to forces perpendicu- 
lar to the plane of the plate. Assuming that the thickness h of the 
plate is small in comparison with its other dimensions and that the 
deflection under load is small in comparison with h, it is shown that 
all stresses in the plate can be calculated in terms of the deflection 
w(x, y) of the “middle surface” of the plate (Figure 1). The function w 
is shown to satisfy the linear differential equation: 


a 
(1) NAAw = F(x, y), A=—+—>) 


in which F is the force per unit area applied at right angles to the 
plane of the plate and N is a constant depending only upon the thick- 


er. 
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ness h and the material of the plate. Once a solution w(x, y) of (1) 
has been obtained the most important quantities from the practical 
point of view, that is, the corresponding stresses, can be calculated. 
More specifically, the normal stress ¢, and the shear stresses 7, and 
Tz: (on an element perpendicular to the x-axis), for example, are de- 
termined from w through the formulas 


(2a) r= (= +y 
(2b) (=) 
(2c) Tz3 = a3 — (Aw), 
Ox 


where a1, a2, a3 depend only upon the thickness of the plate and upon 
the distance z from the middle surface to the point where the stresses 
are to be found. The essential point here is that the a; are independent 
of x, y, and w. The quantity v in (2a) is a material constant called 
Poisson’s ratio. 

To solve the problem posed by Chladni’s experiments would require 
the solution of the linear eigenvalue problem associated with (1) (that 
is, that belonging with the biharmonic equation) under the homo- 
geneous boundary conditions resulting from the assumption that no 
forces are applied at the edges of the plate. This would mean, one 
would suppose, that all three of the expressions on the right side of (2) 
should vanish at an edge perpendicular to the x-axis. But the differ- 
ential equation is of order four, and consequently only two independ- 
ent boundary conditions can be imposed. This puzzling fact had not 
escaped the notice of the early workers in the field, but it remained 
for Kirchoff [9] to settle the question. Kirchoff started from the usual 
assumptions of the theory, deduced the expression for the potential 
of the deflected plate, and found by the calculus of variations not 
only the differential equation (1) but also the correct boundary con- 
ditions. At a free edge he found that condition (2a) equated to zero 
continues to hold but that conditions (2b) and (2c) must be replaced 
by the single relation (for an element of the boundary perpendicular 
to the x-axis) 

(2d) = —+ (2 —») =0 
x Oxdy 


This was probably the first striking illustration of the power inherent 
in the calculus of variations to furnish the correct and appropriate 
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boundary conditions for a differential equation. Kirchoff had found 
what are now called the natural boundary conditions. The idea of 
natural boundary conditions, when raised to the level of a general con- 
cept, has proved to be a fruitful one which has been extensively ex- 
ploited, for example, in the work of R. Courant and his pupils. 

Finally, Kelvin and Tait [8] showed in 1876 how the result of 
Kirchoff could be reconciled physically with the mechanics of the 
situation. They observed that conditions (2a, b, c) could all be pre- 
scribed for a thick plate and that plausible physical arguments could 
be advanced indicating that conditions (2b) and (2c) coalesce into 
(2d) when the thickness of the plate is made small. This aspect of the 
problem has never been given a rigorous mathematical formulation. 
It is in fact an example of a “boundary layer” problem—a type of 
problem to be discussed later in another connection. 

A vast amount of work has been done in solving the special linear 
boundary value problems arising from (1) for plates of various shapes 
and for various prescribed loading functions F(x, y) and conditions 
at the boundaries. Many of the cases arising in practice have never 
been solved in closed form or by series, and resort has been taken to 
various methods of approximation. One of the most famous of these 
is the method of Ritz [14]. It is interesting to note that Ritz himself 
applied his method first to a number of plate problems. Among the 
problems solved by Ritz is included the first and only solution of the 
Chladni problem, which had remained unsolved for more than a cen- 
tury. The work of Ritz represents one of the early successes of what 
are now called direct methods of the calculus of variations. However, 
the method of finite differences has perhaps proved even more service- 
able in obtaining approximate numerical solutions of the linear bend- 
ing problems [cf. 10]. 

Although the linear bending problem can be considered as solved 
in its essentials from the point of view of the mathematician, since 
the relevant existence and uniqueness theorems are available, there 
remain nevertheless some unsolved special problems of mathematical 
interest. For example, there is the curious problem of the rectangular 
plate resting along its edges on rigid supports and subjected to a con- 
stant downward distributed pressure F. It is easily shown that a 
downward concentrated force is necessary at the corners of the plate 
in order to maintain the edges in contact with the supports; if such 
forces are not provided, experiments show that the corners will lift 
up, the plate remaining in contact with the supports only in the cen- 
tral portions of the sides. This presents a boundary value problem of 
unusual type: The boundary conditions are not given a priori in the 
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form of equalities, since the position of the points at which the edge 
of the plate rises from the supports is not known in advance. The con- 
dition ¢,=0 holds everywhere on a boundary line x =const., but there 
are two alternatives for the other boundary condition, that is, either 
w=0 with g. £0 or w<0 with g,=0. In spite of this, the problem prob- 
ably has a unique solution. 


3. The von Karm4n equations. The linear theory of bending de- 
scribed above was sufficient for most practical purposes until rather 
recently. With increased use of very thin metal plates, particularly 
in the structures used in aircraft, it has become of practical impor- 
tance to obtain solutions based on a theory which would permit 


4 


x} 


> —+ 
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deflections much greater than the thickness of the plate. The experi- 
ments indicate that the linear theory fails to mirror the physical facts 
if deflections as little as one-fourth of the thickness of the plate occur. 
We owe to von K4rmén [6] an extension of the theory in which the 
squares of the slopes of the deflected middle surface are not neglected, 
as is the case with the linear theory. At the same time, the hypothesis 
of the linear theory that the stresses in the middle surface are zero is 
given up. The resulting equations of von Karman are the following 
pair of fourth order nonlinear differential equations for two functions, 
the deflection w(x, y) and a function ¢(x, y) from which the stresses in 
the middle surface, often called the “membrane” stresses, are derived: 


(3) NAAw = F(x, y) + — —-—2 +— —>}, 
Oy? dx? Oxdy Axdy dx? dy? 
d?w d*w 
(4) AAg = E 
Oxdy Ox? dy? 


The quantity h is the thickness of the plate and E is a material con- 


a 
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stant. The stresses o/, Tzy, ¢,, in the middle surface of the plate 
(Figure 2) are obtained from ¢ by the formulas 
¢; = 
oy? 


(5) 


The boundary conditions for w are linear ; they remain the same as for 
the linear bending problem. At a free edge, for example, the Kirchoff 
conditions are to be satisfied. Boundary conditions for ¢ are obtained 
from (5) in accordance with whatever “horizontal” forces p are as- 
sumed at the boundary of the plate (Figure 2), the term “horizontal” 
forces being used to distinguish such forces from the “vertical” forces 
F. It is easily shown that this is equivalent to prescribing the values 
of ¢ and its normal derivative df/dn at the boundary of the plate. 


4. Buckling problems. A special class of problems is obtained from 
(3) and (4) by assuming F=0; this we shall do from now on. In other 
words, there are assumed to be no lateral forces to cause bending. In 
addition we take always homogeneous boundary conditions for w. 
The problems that arise under these circumstances we refer to as 
buckling problems. 

For the sake of simplicity we assume also that the horizontal forces 
at the boundary are normal compressive forces which depend linearly 
on a factor X. This is equivalent to taking for @ the boundary condi- 
tions 


(5). dp/dn = d¥, 


where ® and ¥ are given functions defined at the boundary of the 
plate and ) is a factor proportional to the applied pressure. Under 
these circumstances, it is clear that w=0 is always a solution of (3), 
since w is assumed to satisfy homogeneous boundary conditions. This 
is also the unique solution for w when X is small enough (in other 
words, when the applied compressive forces are small enough). How- 
ever, there is always a critical value \. of \ at which the plane state 
becomes instable and the plate bends, or buckles, in the engineering 
terminology. Mathematically this means that a bifurcation of the 
solutions (cf. Figure 4) takes place for \=X. and solutions appear for 
which w is not identically zero. In this behavior we are reminded of 
linear eigenvalue problems, though the theory of such problems is 
naturally not applicable here. Nevertheless it can be shown that the 
lowest value \. of X for which a bifurcation takes place (or, for which 
buckling just begins) can always be found from the linear eigenvalue 
problem which results when the right-hand side of (4) is set equal to 
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zero. This implies that the solution ¢6=Adg@po of (4) does not depend 
upon w, and hence 


ay? ax? ax? dy? 


2, 2 2, 2 2, 2 
together with linear homogeneous boundary conditions for w then 
presents a linear eigenvalue problem from which the lowest value of 
for a bifurcation of the solution can be determined.! 

As in the case of the linear bending problem, a considerable litera- 
ture has grown up around the eigenvalue problems posed by (6) and 
many special cases have been treated. Many difficult special cases of 
practical importance were solved, for example, by Timoshenko [15]; 
the method used by him bears strong resemblance to the Ritz method. 
The solutions of these problems have, in general, the properties one 
is accustomed to associate with linear eigenvalue problems: they pos- 
sess a discrete spectrum and the eigenfunctions form a complete sys- 
tem in terms of which “arbitrary” functions can be developed. 

The problems of existence and uniqueness of the solutions of the 
eigenvalue problems associated with (6) have been settled in the 
main. These questions have, however, been reconsidered in recent 
years by A. Weinstein [17, 18] from a new point of view. Weinstein 
has shown that certain classes of plate problems (vibration problems 
as well as buckling problems) are equivalent to an appropriately 
chosen sequence of second order membrane problems. An interesting 
by-product of Weinstein’s work is that his method furnishes for the 
eigenvalues lower bounds which can be calculated numerically. 

If a plate is proportioned in such a way that it collapses when com- 
pressed by loads only slightly greater than that at which buckling 
just begins, the linear theory of buckling is evidently sufficient for 
practical purposes. But very thin metal plates of the type used in 
various modern structures do not fail even when the applied pressure 
is many times greater than that at which buckling begins. As a con- 
sequence, it is a problem of practical as well as mathematical interest 
to pursue the solutions of equations (3) and (4) into the nonlinear 
range for high values of the ratio A/A.. In fact, it is highly desirable 
to investigate if possible the asymptotic behavior of the solutions as 
A/A. tends to infinity. 

The most important case from the practical point of view would be 
that of the rectangular plate. It is clear that the numerical solution 


1 This approach to equation (6) is not the historical one; equation (6) was derived 
by G. H. Bryan [1] before the von K4rm4n equations were known. 


254 J. J. STOKER [April 


of the boundary value problem posed by equations (3) and (4) in this 
case under prescribed boundary conditions and for all values of X is 
one of very great difficulty. Some approximate solutions (for example, 
see [11]) have been given by the energy and perturbation methods, 
the latter being, roughly speaking, a solution in the form of a power 
series development with respect to \—A.. It is difficult to estimate the 
accuracy of such solutions, but it seems certain that they are accurate 
only for rather low values of the ratio A/A¢. 


5. The buckled plate as a ‘‘boundary layer” problem. In view of 
the above remarks, it would seem highly desirable to investigate 
rigorously and in detail, even if only in a quite special case, the nature 
of the solutions of (3) and (4), with F=0, in their dependence on X 
for an unlimited range of values of the ratio A/A,. One would then be 
in a better position to judge the degree of accuracy to be expected in 
applying approximation methods, such as the perturbation method, 
to other cases not easily solved explicitly. K. Friedrichs and the au- 
thor have found and solved such a special case [2, 3, 4]. 


Membrane stresses stresses 


Fie. 3 


The special problem in question is that of the circular plate with 
constant radial pressure applied at the boundary and with radial sym- 
metry assumed (Figure 3). In this case all quantities depend only 
upon the distance r from the center of the plate and the von Karman 
equations become ordinary differential equations.? Upon introduction 
of the quantities p= (1/r)(d¢/dr) —the radial membrane stress—and 
q= —(R/r)(dw/dr) in place of ¢ and w, respectively, the von Kaérman 


2 Solutions of the von K4rm4n equations for the problem of the bending of the 
circular plate under vertical forces with radial symmetry assumed have been given by 
Stewart Way [16]. Way found that the von K4rm4n equations in this case (in spite 
of their nonlinearity) can be solved conveniently by power series in r. 
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equations reduce to the following pair of nonlinear differential equa- 
tions of the second order: 


(7) 1°Gq + pq = 0, 

(8) Gp = q°/2, 

in which G is the linear operator R*r—*d/dr(r*d/dr). The quantity 7? 
occurring in (7) is given by n?=h/R, hand R being the thickness and 
radius of the plate and y a material constant. At the center r=0 the 


following regularity conditions resulting from the assumed symmetry 
are imposed: 


(9) dq/dr = dp/dr = 0 for r = 0. 
At the boundary r=R we assume 

(10) Rdq/dr + (1 + »)q = 0, 

(11) p=r>0. 


Condition (10) states that the radial bending stress o; is zero at the 
edge, while the important quantity \ in (11) is the prescribed radial 
pressure at the edge. 

Just as in the general case, the only solution of the boundary value 
problem formulated in equations (7) to (11) is g=0, p(r) =const. =i, 
if \ is small enough. The lowest value i, of \ for which a different solu- 
tion appears can be calculated from the linear eigenvalue problem 
arising from (7) and the homogeneous boundary conditions for g, with 
p= in (7) as parameter to be determined. The investigation for 
>A, requires the solution of the nonlinear boundary value problem 
as formulated above. 

The most striking feature of the numerical solutions in their de- 
pendence on ) is that p and g (and, in fact, all quantities) appear to 
tend with increasing \ to become constant in an increasingly large 
portion of the interior of the plate and to change rapidly in a narrow 
strip, or “boundary layer,” near its edge. In addition, p tends to a 
negative constant in the interior of the plate; physically this means 
that the compressive stress applied at the edge of the plate (compres- 
sive stresses are assumed positive here) results in tension at the center 
due to buckling and consequent stretch of the plate. To substantiate 
these statements mathematically it is of course necessary to formulate 
and solve the limit problem for \ tending to infinity. 

There are two distinct limit processes involved, the simpler to deal 
with being that concerning the limit state in the interior of the plate. 
One obtains the formulation of this problem from the original differ- 
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ential equations (7) and (8) by observing that to allow A to tend to 
infinity for a plate of fixed radius and thickness is mathematically 
equivalent to allowing the quantity »?=yh/R in (7) to tend to zero, 
X being held fixed. The result is the limit system: 


Gp = 
pq = 0. 


The only solution of these equations which satisfies the regularity 
conditions at r=0 is g=0, p=const. However, the quantity p cannot 
be set equal to the positive constant prescribed at the boundary, as 
the value of p in the interior of the plate seems certainly negative in 
the limit. The limit problem for the interior of the plate is thus not 
self-contained, since it fails to provide the value to be assigned to p 
in the limit. 

The limit solution for the interior of the plate can be completed 
only after the limit problem connected with the boundary layer phe- 
nomena has been formulated and solved. It should be pointed out 
that the occurrence of such boundary layer phenomena is not con- 
fined to the special problem under consideration here.? Phenomena of 
the same nature have been noted in various problems concerning thin 
curved shells (see, for example, [13 ]). Boundary layer problems occur 
also in certain nonlinear vibration problems, for example in those con- 
cerning what are called relaxation oscillations. All such problems have 
the same general aspect: Each is a boundary value problem associ- 
ated with a system of differential equations containing a parameter X. 
A limit solution is desired for a value of \ which causes the order of 
the system of differential equations to degenerate. The consequence 
is that some of the original boundary conditions are lost, with the re- 
sult that the convergence as regards becomes nonuniform at the 
boundary. 

Not all such problems are nonlinear. The problem of Kirchoff- 
Kelvin-Tait discussed earlier ought to be formulated as a boundary 
layer problem. The starting point would be the thick plate problem 
and the limit problem that obtained upon allowing the thickness to 
approach zero in the differential equations. The differential equations 
would degenerate and some boundary condition would be lost at the 
edge. It should be possible to find the nature of the limit state in the 
boundary layer, at least in some special cases. 

No general theory for boundary layer problems exists, but the gen- 


(12) 


3 The first and most famous boundary layer problem is the one discovered and 
treated by L. Prandtl [12] in studying the flow of a viscous fluid around an obstacle. 
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eral scheme of attack can be illustrated by means of the case of the 
buckled circular plate. The solution of the problem in this case, as in 
others, requires the introduction of a new independent variable in 
place of r which depends upon the parameter \ in such a way that 
the width of the boundary layer, as measured in the new scale, does 
not shrink to zero when X tends to infinity. At the same time new de- 
pendent variables, replacing p and g and also depending upon X, are 
introduced in such a way that the transformed differential equations 
resulting from (7) and (8) do not degenerate when J tends to infinity. 
In short, it is necessary to introduce new variables which have the 
effect of making the convergence uniform. Limit boundary conditions 
must also be obtained. The result is in the case of the circular plate 
a new nonlinear boundary value problem which can be solved to 
yield limit values for all quantities, including those of most practical 
importance—the maximum stresses. The maxima of both bending 
and membrane stresses occur, by the way, in the boundary layer in the 
limit. Once limit values for all quantities in the boundary layer have 
been found, we are provided with the particular quantity needed to 
complete the solution of the interior limit problem—this is, quite 
naturally, the limit value of p at the inner edge of the boundary layer. 

It may be mentioned that the occurrence of the boundary layer 
phenomena in the case of the circular plate was first observed by solv- 
ing the original differential equations for a series of increasing values 
of \. Only afterwards were rigorous proofs of the above statements 
worked out. I should like at this point to call attention to some re- 
marks made by von K4rman in his Gibbs Lecture relative to the need 
and value, even to those whose interest is mainly a practical one, of 
existence, uniqueness, and convergence proofs in cases where it is not 
obvious that the physical phenomena have received a correct mathe- 
matical formulation. 

Proofs that the limit situation in the buckled circular plate is as 
described above can be carried out through a confrontation of the 
boundary value problem with a minimum problem of the following 
type: The absolute minimum of the functional V, (essentially the po- 
tential energy) given by 


(13) Valg] = D[g] — + 


is to be found, where D, H, and K are integrals, the first two quad- 
ratic in g and dq/dr, the third of fourth degree in dg/dr. The minimum 
of V, is sought in the class of all continuous functions g with L?-in- 
tegrable derivatives for which the integrals are finite. It is shown, by 
direct methods, that the minimum of V) is taken on for a certain such 
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function (for each X, including A= ©) and that each such function 
possesses a continuous second derivative and furnishes a solution 
(though by no means all solutions) of the boundary value problem. 
The relation between the solutions of the minimum and boundary 
value problems is indicated schematically in Figure 4 which shows the 


w 
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deflection w of the center of the plate in its dependence on X. The 
heavy curve indicates the type of solution of the minimum problem. 
(That the solution —w always occurs with +w is readily seen.) How- 
ever, the boundary value problem has many other solutions, indi- 
cated by the dotted curves, in addition to w=0, which is always a 
solution. The solutions of the boundary value problem which do not 
vanish identically and which are not solutions of the minimum prob- 
lem begin at values of \ corresponding to the higher eigenvalues of 
the linearized buckling problem. 


6. Some unsolved problems. Quite a number of unsolved questions 
can be raised at once, even for the special case of the buckled circular 
plate. For example, it is possible that the solution of the minimum 
problem eventually becomes instable for a certain value of \ and that 
a new bifurcation arises if the restriction of radial symmetry is given 
up so that the plate can buckle in waves relative to the original 
buckied state. Experiments on buckling of rectangular plates indicate 


/ 
wzo / 
A 
A, A,\ A,\ 
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that such a “second buckling,” or buckling of higher order, may well 
occur. Also, it would be of interest to characterize as minimum prob- 
lems those solutions of the boundary value problem which begin at 
the higher eigenvalues of the linearized problem. As is well known, a 
similar question is solved in the case of linear eigenvalue problems by 
imposing orthogonality conditions, or by the formulation of a mini- 
mum-maximum problem. One could also inquire about the limit state 
in the case of the circular plate when forces perpendicular to the plane 
of the plate are applied, with various prescribed boundary conditions. 
It is known, for example, that the boundary layer effect for the mem- 
brane stress does not occur if no compressive stresses arise at the 
boundary of the plate. 

From the practical point of view it would be of high interest to 
carry the solutions through for the case.of the buckled rectangular 
plate. Even the asymptotic boundary layer solution alone would be 
of value. The problems with two independent variables seem, how- 
ever, to be of great difficulty.‘ 

One could go on to give a lengthy list of unsolved problems con- 
cerning the flexure of plates, some of which have never even been 
formulated mathematically. For instance, a theory of thin plates 
analogous to the elastica theory for thin rods ought to be worked out. 
In such a theory the assumptions of small deflections and slopes of 
the middle surface would be given up. To complete the analogy with 
the elastica theory, it should be assumed that the middle surface is 
unstretched, that is, that it remains a developable surface. In this 
case the right-hand side of the equation corresponding to (4) would 
be zero, since the right side of (4) is essentially the Gauss curvature 
of the middle surface. One would have to pay for this simplification 
by using the exact expressions for the principal curvatures of the 
middle surface. 

There remain unsolved problems of a purely mathematical charac- 
ter among the classical linear bending problems. The solutions of such 
problems are in general analytic functions with certain singularities, 
the character of which is known for a variety, though not all, of the 
commoner physical situations. The singularity at the corner of a 
clamped rectangular plate ought to be determined. The numerical 
solution of the previously mentioned problem of the rectangular 


4 Some years ago von K4rm4n, Sechler, and Donnell [7] gave what can be re- 
garded as a rough approximation to the asymptotic solution for the rectangular plate 
based on physical assumptions which correspond in a general way to what was found 
later in the boundary layer theory for the circular plate. 
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plate resting on rigid supports but with the corners not held down 
would be greatly facilitated if the singularity at the point of detach- 
ment of the plate from its support could be explicitly determined. The 
best tools for attacking these problems are probably to be found in 
complex function theory. 

A theory with some claim to generality for boundary layer prob- 
lems would be of high interest. Such a theory should be designed to 
answer a number of questions. In the first place, one does not know a 
priori which of the boundary conditions will be lost in a specific prob- 
lem. This question is coupled with another: In some cases, as in that 
of the circular plate, it is the boundary layer problem which is self- 
contained, while the interior limit problem can be solved completely 
only after the boundary layer problem has been solved. In the case 
of the Prandtl boundary layer problem the circumstances are re- 
versed; here the solution of the interior limit problem furnishes a 
quantity which is needed to complete the solution of the boundary 
layer problem. It is admittedly not easy to see how a general theory 
capable of answering such questions should be constructed ; probably 
it is necessary to begin, in the time-honored way, by collecting experi- 
ence through actual solution of a variety of carefully chosen special 
problems. Some physical problems likely to be of interest in this con- 
nection have already been mentioned—the Kirchoff-Kelvin-Tait 
problem, for example. 

The purpose of the addresses on applied mathematics, of which this 
is the first, is to stimulate interest in applied mathematics not only for 
the furtherance of practical ends but also for the advancement of 
mathematics as a whole. A striking illustration of the beneficial effects 
on pure mathematics of the study of the applications is certainly pro- 
vided by the topic of this address. One realizes, then, how great the 
loss to mathematics in solidity, richness, and variety would be if ap- 
plied mathematics generally were to be neglected. 
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ARITHMETIC OF ORDINALS WITH APPLICATIONS TO 
THE THEORY OF ORDERED ABELIAN GROUPS 


PHILIP W. CARRUTH 


1. Introduction. The operations of addition and multiplication of 
ordinals do not behave as well as one might desire. For example, the 
commutative laws are not valid, and the distributive law is valid on 
only one side. Consequently, we make definitions of a sum and prod- 
uct that do not have such defects. 

A binary operation, a@§, on ordinals is termed a natural sum if 
a @6 is a well-determined ordinal for any two ordinals, a and 8, such 
that: 

(1) 

(2) (a@8) 

(3) a@0=a, 

(4) 5@a>5@6 if and only if a>8, 
where 6 is any ordinal. 

Throughout this paper, o(a, 8) will denote the natural sum defined 
by Hessenberg.? It is the unique natural sum satisfying the condition 
that w*-m+o%-n=a(w*m, w*n), where a and 8 are any two ordinals 
such that a28, and where m and m are any two positive integers. 
a(a, 8) shall be shown to be the “smallest” natural sum, and it shall 
be shown to be the best bound for the order type of the join of two 
well-ordered subsets, of respective order types a and 8, of an ordered 
set. 

A binary operation, a @§, on ordinals is termed a natural product if 
a @6 is a well-determined ordinal for any two ordinals, a and 8, such 
that: 

(1) a@B=B@a, 

(2) 

(3) a@1=a, 

(4) a@6>8@é if and only if a>, 

(5) c(a@B, 5), 

(6) =w’7, 
where 6 is any ordinal, and where y =y(a, 8) is a suitable ordinal. 


Presented to the Society, September 5, 1941; received by the editors June 6, 1941. 
The writer is indebted to Professor Reinhold Baer for his advice in the preparation of 
this paper. 

1The writer presupposes familiarity with the material on ordinals found in 
F. Hausdorff’s Mengenlehre. 

2G. Hessenberg, Grundbegriffe der Mengenlehre, Abhandlungen der Fries’schen 
Schule, (n. s.), 1.4, Gottingen, 1906, no. 75. 
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It may be noted that the ordinal y above is a natural sum of a 
and B. If we impose the condition that this ordinal be equal to o(a, 8), 
we obtain the unique natural product defined by Hausdorff.* This 
particular natural product will be denoted by 2(a, 8) throughout this 
paper. We shall show that z(a, 6) is the “smallest” natural product 
and that it is the best bound for the order type of a certain rectangular 
array of ordered elements that has “a rows” and “8 columns.” An- 
other application of Hausdorff’s natural product will be given in 
determining a bound for the order type of the semi-group generated 
by a well-ordered set of positive elements in an ordered Abelian 
group. 

It is known that every ordinal a may be represented in the form, 
>a, 20. Henceforth in this paper, when an ordinal is written in sum- 
mation form, it will be assumed that the summation satisfies the 
above requirements. 


2. Natural sums and products of ordinals. We first prove the follow- 
ing theorem. 


THEOREM 1. Let r@p be any natural sum. 

I. Let T and R be well-ordered subsets, of respective order types t and 
p, of an ordered set of elements. Then the set of elements in the join of R 
and T, that is, R+-T, is well-ordered and has order type less than or 
equal tor @p. 

II. There exists a well-ordered subset R, of order type p, of an ordered 
set, and a well-ordered subset T, of order type rt, of the same ordered set, 
such that the order type of R+-T is exactly a(p, rT). 

III. o(p,7)S7T@p. 

IV. r-+ps7r@p.5 


ProoF oF I. I is true for r=0 and for all p. Assume it to be true 
for all p and for all 7 less than a. It is true for p=0 and r=a. Assume 
it to be true for all p less than 8 and t=a. We shall now prove I to 
be true for p=8 and r=a. 

It is easily verified that R+T is well-ordered for p=8 and r=a. 
Suppose the order type of R+T is greater than B@a. Then R+T has 
a segment Z of order type B@a. By our induction hypothesis, it is 


3 F. Hausdorff, Mengenlehre, 3rd edition, p. 70. 

4 For a definition of an ordered Abelian group, see S. MacLane, The uniqueness of 
the power series representation of certain fields with valuations, Annals of Mathematics, 
(2), vol. 39 (1938), p. 371. 

5 o(7, p) may be larger than the maximum {r+p, ptr } , for example, let r=p=w-+1. 
Then r-+p=p+7=w2+1, and o(p, r) =w2+2. 
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seen that the order type of the subset of elements of T that are in Z 
must be a, since a@B >u@B for a>p. Likewise the order type of the 
subset of elements of R that are in Z must be 8. The elements of R 
that are in Z form a segment of R, and the elements of T that are in Z 
form a segment of T. Hence the set R+T is the same as the segment 
Z of R+T. This gives us a contradiction, proving I. 

Proor oF II. Let wa;, and Then o(p, 7) is 
equal to >.” ,w**(a;+b;). Let R be a set of elements of order type p. 
Then R is the join of sets R:;, 1 Sin, where R; has order type w%‘a;, 
and where each element in R; is smaller than every element in Ri+1. 

Let T be a set of elements of order type r. Then T is the join of sets 
T;, 1 Sin, where T; has order type w%‘b;, and where each element 
in T; is smaller than every element in 7,41. 

By letting each element in T; be greater than every element in R; 
and smaller than every element in Rjs1, we form a set R+T of order 
type a(p, 7). 

III is an immediate consequence of I and II, and IV is an immedi- 
ate consequence of I. 

An ordered set of elements, A = {das} , 0Sa<r, OSB <p, where 7 
and are any two ordinals, is an ordered 1, u-block if dag <@ay, Opa 
for 8<-y. An ordered 1, u-block A is said to be less than an ordered p, 
d-block B, that is, A <B, if each element in A is considered to be less 
than every element in B. Two ordered 1, y-blocks, A = {aas} and 
B={b.s}, are congruent if the following two conditions are equiva- 
lent: (1) d@ag<@px, (2) 


THEOREM 2. Let 7 @y be any natural product. 

I. Let A= { Gas} be an ordered 7, u-block. Then A is well-ordered and 
has order type less than or equal to T@u. 

Il. There exists an ordered u-block of order type exactly 

III. S7@u. 

IV. tuS7@u.® 


Proor oF I. Suppose that A is not well-ordered. Then there exists a 
subset B of A, @a,8, >@a.8,> ---, Of order type w*. Since the a; are 
well-ordered, we may choose a chain C, @3,,,>d3,7,> ---, of order 
type w*, from the set B such that 6,5 &< - - - . Since the y; are well- 
ordered, there exists a y;, say such that 21. But then 
which is a contradiction. 

The set A has order type less than or equal to r@y for r=1 and 
all u. Assume this to be true for all u and for all 7 less than 6. It is 


6 x(p, 7) may be larger than the maximum {rp, pr} , for example, let r=p=w+1. 
Then and =e? +0241. 
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true for 4=1 and r=6. Assume it to be true for all u less than y and 
t= 46. We shall now prove A has order type less than or equal to T@yu 
for and r=6. 

Let 6=w%m+ --- +w%m, and -- - +wen,. 

Case 1. m,>1 or r>1. 

Let p=wm,+ -- - +w*%(m,—1), where m,—1 20. By hypothesis, 
6>p>O0and 6>w*. 

Let 7; be the set {aes}, where 0 Sa<p and 0<8<yv7. Let T2 be the 
set {aus} , where pSa<6 and 0<8<y. Then A is the set 71+7>. By 
our induction hypotheses, the order type of 7; is less than or equal 
to p@y. Likewise, the order type of T> is less than or equal to w*"@y. 
Theorem 1 implies that the order type of A is less than or equal to 
a(p@y, @y) =7 @a(p, 

Because of the symmetry, A has order type less than or equal to 
for and if or s>1. 

Case Il. my=r=m=s=1, that is, 6=w% and y=w*!. 

Suppose that the order type of A is greater than 6@y. Then A has 
a segment Z of order type 5@y. Let A =Z+ Y. Let a), be the smallest 
element in Y. Then 6>A and y>e. 

Let T; be the set {aag}, where OSa<é and 0<B<e. Let T: be the 
set {a.s}, where 0Sa<d and where e<8<y. Let W=Ti+72. 


Goo, Goi, *** » Bde, °°” 


The set at the left of the dotted line is 71, and that in the upper right- 
hand corner is 72. 

W certainly contains Z. Therefore, according to our supposition, 
the order type of W is greater than or equal to 5@y. 

By our induction hypothesis, the order type of 7; is less than or 
equal to 5@e, and the order type of T> is less than or equal to y@d. 
Theorem 1 implies that the order type of W is less than or equal to 
a(5@€, y@d). According to (6) of the definition of a natural product, 
5@y=w", where 7 is some ordinal. Likewise, by (4) of this definition, 
it is seen that 5@€<w’ and that y@A<w". The definition of o(a, 8) 
implies that (5@€, y@A)<w"=5@y. This then isa contradiction 
which completes the proof of I. 

Proor oF II. Let r=w%a1+ - - +w%a,and p=w*th,+ - - - +whb,. 


266 P. W. CARRUTH [April 


Then =>. oc, where ,*aib;, where 
is over the finite number of pairs such that o(a;:, B;) =x. 

Any ordered 1, u-block, A = {a.s}, may be broken up into ordered 
wia;, ~blocks, D(i, j) = {aes}, where w%a;+ --- 
<w%a;+ +w%a; and SB < + eee 
+w*ib;. 


Ssh, 
1) D(A, 2) s) 
wd, D(2, 1) D(2, 2) eee D(2, s) 

wa, D(r, 1) D(r, 2) tee D(r, s) 


Suppose that we could order our blocks D(i, 7) in such a way that 
each D(i, 7) would have order type exactly 1(w*‘a;, wib ;) 
By then letting D(i, 7) be less than D(k, m) if o(a:, Bj) >o(ax, Bm) or 
if o(a:, B;)=a(ax, Bx) and we would have an ordered 1, u-block 
of order type exactly 1(7, u). Thus it is seen that it suffices to prove II 
for the special case that t=w*a and p=wd. 

Hence we now assume that r=w%a and that p=wb. Let 
--- +w7f, and --- +w'd,. We first assume 
that ¢=0. In this case a=0 and =a. If we let for it is 
seen that an ordered 1, u-block is formed of order type ur =7(u, 7). 
Assume that II is true for all integers ¢ less than & and for all integers 
v. We shall now prove II to be true when t=k. 


= we wit ee 


Then 


wt = = 


cP = -+udedy 


Let 
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where 6,27: > 6.41, 0SeSv. Let 
If e=0, let \=1. If e=v, let »=1. 
wb) = ta, dnd) 
= d), ab 
= 


DIAGRAM 
Fo 
An 
Do > 
G, 
A => 
w* Fi 


By our induction hypothesis, there exists an ordered ¢, \-block D, 
of order type 4). We may construct congruent ordered ¢, 
blocks Dz, 0Sa<w*%%, as in the Diagram, such that D,<Dg for 
a<B. 

Let Ep be the block of all the D,. Ey is an ordered w, \-block. The 
order type of Ep is +($, X)-w*%/. We may now construct congruent 
ordered w*, A-blocks Es, 0S8<7n, as in the Diagram, such that 
Es<E,;, for B<6. 


= 
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Let Fy be the block of all the Eg. Then Fo is an ordered w*, \n-block. 
The order type of Fo is 


d) - 


Now we construct congruent ordered w*, Ay-blocks F;, 0<i<a, such 
that F;< F;, for i<j. 

Let Go be the block of all the F;. Then Go is an ordered w%a, An- 
block. The order type of Go is 1(¢, \)w*™e- na. We construct con- 
gruent ordered w%a, An-blocks G;, 0<i<6, as in the Diagram, such 
that G;<G;, for i<j. 

Let A be the block of.all the G;. A is an ordered wa, w®b-block. The 
order type of A is 


III is an immediate consequence of I and II, and IV is an immedi- 
ate consequence of I. 


3. Well-ordered subsets of ordered Abelian groups. We prove the 
following theorem. 


THEOREM 3. Let G be an ordered Abelian group. Let T be a well- 
ordered (according to size) set of non-negative elements in G, containing 0, 
of order type B=1-++-n, where rt ts a limit ordinal or 0, and n ts an integer 
greater than or equal to 0. Let M be the set of all finite sums of elements 
in T. Then: 

I. M is well-ordered. 

II. M has order type less than or equal to r(w®, r7).7 


Proor oF I. 

Case 1. There exists an integer N such that each element in M is 
the sum of N elements in T. 

Let M; be the set of sums of at most z elements in T. Then M; is 
well-ordered if i=1. Assume that M;,_: is well-ordered. Suppose that 
M, is not well-ordered. Then there exists a chain of elements in M;, 
a>a.> ---, of order type w*. Let a;=c;+d;, where c; is in My-1, 
and d; is in M,=T. Then a+d:>c.+d.>---. The c; are well- 
ordered. Likewise the d; are well-ordered. Hence this chain can not 
exist, that is, M; is well-ordered. 

Case 2. There exists no integer N such that each element in M is 
the sum of WN elements in T. 

Let T be the set {ta } , where t, <ts for a<f. Let D, be the subgroup 
of G that is generated by /, and all g in G such that 0<g<?,. Then 


7 If B is an integer, then a better bound of the order type of M is r(w®, 77) =w®-, 


1942] ARITHMETIC OF ORDINALS 269 


0=D,.sDisD2s ---. We pick out the well-ordered set R of all the 
different D,. R= { Ro, eee };0=Ry)<Ri< SG. 

Let M(a) be the set of all elements in M such that each may be 
expressed as a finite sum of elements in T, all of which are in R.. 

M(0) consists of the single element 0 and so is well-ordered. Assume 
that M(q) is well-ordered for all a<y. Suppose that M(7) is not well- 
ordered. Then there exists a chain in M(y), a1>a2> ---, of order 
type w*. 

(a) Only a finite number of the a; are in M(y) but not in an M(a), 
a<y. 

In this case, we may throw out this finite number of a; and then we 
have a chain, ¢:>c2> ---, of order type w*, where each ¢; is in an 
M(a), a<vy. Let c; be in M(p), p<. By our induction hypothesis, 
there exists a c;, say c,, not in M(p). But then ¢,>c, a contradiction. 

(b) An infinite number of the a; are in M(7) but not in any M(a), 
a<y. 

In this case, we may choose a chain, ¢:>c2> - - - , of order type w*, 
where each ¢; is in M(y) but not in any M(a), a<y. 

Let c:=f:+g:, where f; is in an M(a), a<vy, and g; is a sum of terms 
in T, each of which is in R, but not in an R,, a<y. We saw in (a) 
that the f; must be well-ordered. Hence the g; must not be well- 
ordered, that is, we can choose a chain from the gi: ga, >a,>°°*» 
of order type w*. Each g.,; is a sum of terms in T, all of which are 
in R, but not in any R,, a<y. 

Let ¢ be the smallest positive element in T that is in R, but not in 
an R,, a<y. There exists an integer m such that nt is greater than ga,. 
According to Case 1, there must exist a g.,, Say Za;, that is the sum of 
more than 1 positive elements in T. But then g.;>mt>ga,, a contra- 
diction. 

Before we proceed, we shall prove this lemma. 


Lemma 1. Let G be an ordered Abelian group. Let T be a subset of 
non-negative elements in G, containing 0, that is well-ordered (accord- 
ing to size) of order type B. Let Hy be the set {>-1_,a:n;}, where the n; 
are non-negative integers such that )3.4niSk, the a; are in T, and k 
is any positive integer. Then Hj, is well-ordered and has order type less 
than or equal to = - - - (71(8, B), 8), - - - ), B), where the 
subscripts merely denote the number of ’s. 


Proor. H; is well-ordered, by Case 1 of Theorem 3. The order type 
of Hi is 8. Assume H}_, has order type less than or equal to ¢,_:(8). 
Let the elements in be do, ---, where for a< 5. 
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The set R= {batas}, all b. in Hj-1, and all elements a; in T, cer- 
tainly contains Hy. Theorem 2 implies that the order type of H, is 
less than or equal to 7(¢s_1(8), B) =o: (8). 

We now continue with the proof of Theorem 3. 

Proor oF II. Let M* be the set of all elements in M that are sums of 
elements ¢,, OSy<a, in T. Note that Let where 
Tq is a limit ordinal or 0, and mz is an integer greater than or equal to 0. 

M' has order type 1. Assume that M+, a<4, has order type less 
than or equal to r(w*, 73%). 

Case 1. ns>0. 

M*~' has order type less than or equal to (w*-', 7j*). M® is cer- 
tainly contained in the set { Qat+nts_1}, for all elements a, in M*-}, 
and 0<n< ©. Theorem 2 implies that the order type of M® is less 
than or equal to r(x(w*-!, 75*), w) = r(w', 77°). 

Case 2. ns=0, that is, 6. 

Suppose that M® has order type greater than r(w*, 5°). Then 
M*=S+R, where S is the segment of M® of order type rw’, 6°). 

Let b=t.,m+ <a,<5, be the smallest 
element in R, where the m; are positive integers. Let a,=u-+m, where 
» is a limit ordinal or zero and m is an integer greater than or equal 
to 0. Let m be an integer such that nf,, is greater than b. 

Let H,, be the set of all elements >i tani in M®°, such that rSn, 
di-iSn, and each B;<6. By Lemma 1, it is seen that the order 
type of H, is less than or equal to $,(6) = - 5), 8) 

Let and 6:=w%1+--- +wv, Then 
5 and D2, 

Let x be any element in S. Then x=A.n+ + 
+---+hy. where <A, <a. < <A. <5. Let 
+---+hy,,and g=h,,, +h re. Then fis in M. Since x is less 
than 3, it follows that g is less than b. Hence 7,41+ +--+ +r.-Sm. And 
so g is in H,. Therefore the set S is contained in the set {b.+c,}, 
for all elements b, in H,, and for all elements c, in Ms. Theorem 2 
and our induction hypothesis imply that S has order type less than or 


(1) &), = (5, 5’)), where = + mis an integer, 


(&, 5:>0, since 6 is a limit or- 
dinal. Hence (w*®t(e:+1))*=w*™, since p is a limit ordinal. (w*!(e:+1))é 
=w*1i(e,+1), 


(2) 54) + 1)) = 4 1). 
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517) < + 1)) < + wt! = + bw. 


By (2), 54) = < puto 68, 
Hence by (1), the order type of S is less than r(w’, 5°). This is a con- 
tradiction since S was the segment of M® of order type r(w’, 65°). 
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A CHARACTERIZATION OF ABSOLUTE 
NEIGHBORHOOD RETRACTS 


RALPH H. FOX 


By an absolute neighborhood retract (ANR) I mean a separable 
metrizable space which is a neighborhood retract of every separable 
metrizable space which contains it and in which it is closed. This 
generalization of Borsuk’s original definition! was given by Kuratow- 
ski? for the purpose of enlarging the class of absolute neighborhood 
retracts to include certain spaces which are not compact. The space 
originally designated by Borsuk as absolute neighborhood retracts (or 
9-sets) will now be referred to as compact absolute neighborhood re- 
tracts. Many of the properties of compact ANR-sets hold equally for 
the more general ANR-sets.’ 

The Hilbert parallelotope Q, that is, the product of the closed unit 
interval [0, 1] with itself a countable number of times is a “universal” 
compact ANR in the sense that* every compact ANR is homeo- 
morphic to a neighborhood retract of Q. The classical theory of 
Borsuk makes good use of the imbedding of compact ANR-sets in Q. 
The problem solved here is that of finding a “universal” ANR. 


Received by the editors June 28, 1941. 

1Fundamenta Mathematicae, vol. 19 (1932), pp. 220-242. 

2? Fundamenta Mathematicae, vol. 24 (1935), p. 270, Footnote 1. 

3 Ibid., pp. 272, 276, and 277, and Footnote 1, p. 279 and Footnote 3. Note that 
Theorem 12, Fundamenta Mathematicae, vol. 19 (1932), p. 229, is not true for gen- 
eral ANR-sets. In fact let A =)_S, where S, is the plane circle of radius 2-" and center 
(3-2-*, 0); let f(x, y) =(x, ||) for (x, y)EA and let 

|y|), for (x, y) EA — Su, 
(x,y), for (x, y) Sn. 
Then f,—f in A4; f can be extended to the half-plane {x>0}, but none of the maps fa 
can. A isan ANR-set. Theorem 16, Fundamenta Mathematicae, vol. 19 (1932), p. 230, 
is also false for general ANR-sets. 
4 Fundamenta Mathematicae, vol. 19 (1932), p. 223. 
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Strictly speaking, the problem as just stated has no solution; there 
is no single “universal” ANR, but rather a whole class of ANR- 
sets which together serve in the “universal” capacity. Such a class of 
ANR-sets is the collection of subsets of the Hilbert parallelotope 
Qx [0, 1] which contains the open subset* QX (0, 1] of Qx [0, 1]. 


THEOREM 1. For a separable metrizable space X the following three 
conditions are equivalent: 

(1) X is an ANR-set; 

(2) There is a homeomorphism f of X into Q such that f(X) X [0] is a 
neighborhood retract of {(X) X [0]+0x(0, 1]; 

(3) f(X) x [0] is a neighborhood retract of f(X)X[0]+0X (0, 1] for 
every homeomorphism f of X into Q. 


(1)->(3): If f is a homeomorphism of an ANR-set X into Q then 
f(X)X [0] is an ANR-set. Since Q is compact, so that 


X fo] Ce x 


it follows that f(X) X [0] is closed in f(X) xX [0]+0Qx(0, 1]. Hence 
f(X) xX [0] is a neighborhood retract of f(X) x [(0]+0x (0, 1]. 

(3)>(2): Since X is separable and metrizable a homeomorphism f 
exists by Urysohn’s theorem.* 

(2)->(1): Let M be a separable metrizable space containing X in 
which X is closed and let f be a homeomorphism of X into Q. By 
Tietze’s theorem’ there exists a continuous function g defined on M 
with values in Q such that. g(x)=f(x) for every «CX. Let M be 
metrized, with metric d, and let p(x) =min { 1, d(x, X )} for every 
xE€M. Let h(x) = (g(x), p(x)), so that h is a continuous function de- 
fined on M with values in f(X) X [0] +QX(0, 1] which has the prop- 
erty h(M—X)CQxX(0, 1]. Let V be a neighborhood of f(X) Xx [0] in 
f(X) x [(0]4+0x (0, 1] and let V=h-1(V) so that U is a neighborhood 
of X in M. If ris a retraction of V onto f(X) X [0] then the mapping® 
f-'xrh| U, where r denotes the projection of QX [0] onto Q, is a re- 
traction of U onto X. 

Kuratowski also gave an analogous generalization of the notion of 
absolute retract.? According to the extended definition a separable 
metrizable space is an absolute retract (AR) if it is a retract of every 
containing separable metrizable space in which it is closed. 


5 The symbol (0, 1] denotes the half-open interval 0<tS1. 

6 Alexandroff and Hopf, Topologie, p. 81. 

7 Ibid., p. 73. 

8 If BCB’ and ¢ isa function defined on B’ then the notation d=e| B means that d 
is the function defined on B such that d(x) =e(x) for every xE B. 
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THEOREM 1’. For a separable metrizable space X the following three 
conditions are equivalent: 

(1’) X is an AR; 

(2’) There is a homeomorphism f of X into Q such that f(X) xX |0] 
is a retract of f(X)X|0]+0x(0, 1]; 

(3’) f(X) X [0] is retract of f(X) X [0] (0, 1] for every homeo- 
morphism f of X into Q. 


The proof of this theorem is an obvious modification of the preced- 
ing proof. 


Coro uary. If C denotes the open n-cell O<x;<1 (¢=1,---,m) and 
D denotes the closed n-cell OS x;S1 (4=1,---,m) then any set E such 
that CCECD is an AR. 


By condition (2’) and a retraction of QX [0, 1] onto Dx [0, 1] it is 
sufficient to show that E X [0] is a retract of EX [0]+D X(0, 1]. This 
can be done by projecting from the point (1/2, --- , 1/2, —1) of Eu- 
clidean (n-+1)-space. 

It may be worth noting that conditions (2) and (2’) make possible 
a simpler proof of the Borsuk-Kuratowski® theorem(s): 


If W is a closed subset of a normal space Z and X is an AR-set 
(ANR-set) then every continuous map of W into X can be extended to Z 
(to a neighborhood of W in Z). 


In fact conditions (2) and (2’) replace a theorem of Kuratowski!® 
which involves infinite polyhedra. 


THEOREM 2. An ANR is locally contractible.1 An AR is also con- 
tractible. 


Using (2) we can suppose that our ANR-set Y is contained in 
Qx [0] and that there is a retraction r of an open neighborhood V of 
Yin Y+QX(0, 1] onto Y. But V is the intersection of Y+QX (0, 1] 
with an open set V’ of QX[0, 1]. Let yEY and let S, denote the 
e-sphere in QX [0, 1] about the point y. Since r is continuous there 
is a 5>0 such that the intersection 7; of the 6-sphere S; and 
(0, 1] is contained in V’, hence in V, and r(T3) CS,. Let de- 
note a contraction of S; to a point pES;-(QX(0, 1]) which moves 
points rectilinearly, so that u(x)€QX(0, 1] for every 0<#S1 and 


® Fundamenta Mathematicae, vol. 24 (1935), p. 275. 

10 Fundamenta Mathematicae, vol. 24 (1935), p. 266, Theorem 2. 

11 But not uniformly. See the example in Footnote 3. This theorem was proved by 
Borsuk, Fundamenta Mathematicae, vol. 19 (1932), p. 237 for compact ANR-sets. 
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yEY-S;. Then ru,| Y-S; contracts Y-S; in Y-S,. The second state- 
ment is a consequence of Theorem 3’. 


THEOREM 3. A separable metrizable space X is an ANR if and only 
if for every separable metrizable space M containing X (in which X need 
not be closed!) there is a neighborhood U of X and a continuous function 
h defined on XX[0]+UX(0, 1] with values in X such that® 
h| X [0, 1] és a deformation.** 


Suppose X is an ANR and M a separable metrizable space con- 
taining X. We may assume that MCQ. By (2) and (3) there is an 
open neighborhood V’ of XX [0|+@QxX(0, 1] and a retraction r of 
1]) onto XxX[0]. Let X(x)=d(x x [0], 
Q~x [0, 1]—V’) for every xEM and let U=2(V’-(QX [0])) where, 
as before, x denotes the projection of QX [0] onto Q. Define for every 
(x, 2)EX xX [0] +Ux(0, 1], 


h(x, t) 


ar(x, t), when ¢ S X(x), 
mr(x, A(x)), when ¢ = X(x). 


Since \ is continuous and A(x) >0 when x€ U it follows that h is con- 
tinuous. 

Conversely, let U be a neighborhood of X in M =Q and let h be a 
continuous function defined on X X [o]+ UX (0, 1] with values in X 
such that h(x, 0)=x for every x€X. Then h is a retraction of 
X x [0]4+Ux(0, 1] onto Xx [0]. Furthermore X x [0]+Ux(0, 1] 
is a neighborhood of X x [0] in X x [0] +0x(0, 1]. 


THEOREM 3’. A separable metrizable space X is an AR if and only 
if for any separable metrizable space M containing X there is a continu- 
ous function h defined on X X[0|+MX(0, 1] with values in X such 
that® h| X X [0, 1] is a contraction.’ 

Let X be an AR and M a separable metrizable space containing X; 
we may assume that MCQ. Let r be a retraction of X X [0]+Q 
x (0, 1] onto X x [0]. Let pEOQ and let 


h(x, t) = ar(tp + (1 — dx, B 


for every (x, t)}€XX[0]+Mx(0, 1], where z is the projection of 
Qx [0] onto Q. Then hk maps X x [0]+- Mx (0, 1] continuously into X 
and h| X x [0, 1] is a contraction of X. 

The converse is proved as in Theorem 3. 


12 A deformation of X is a continuous mapping h of XX[0, 1] into X such that 
h(x, 0) =x for every xX. If h(X, 1) is a point then h is called a contraction of X. 
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If X is locally compact the deformation h| X x [0, 1] of Theorems 3 
and 3’ can be chosen in advance of M. For then there exists a com- 
pact set M* and a homeomorphism g of X into M* such that 
M*—g(X) is a point. (We can suppose X not compact so that 
M*#g(X).) Let M*CQ. The homeomorphism g can be extended” 
to a continuous mapping g* of X into M* by defining g*(X — X) = M* 
—g(X). The mapping g* of X into Q can be extended, by Tietze’s 
theorem, to a mapping k of M into Q. In the case of Theorem 3 let h 
be the mapping of X x [(0]+-UX(0, 1] into X defined by 


h(x, t) = g-'ar(k(x), min {t, d(x) }), 


where U=g-'1(V’-(QX [0])). In the case of Theorem 3’ let h be the 
mapping of X x [(0]+.MxX(0, 1] into X defined by 


h(x, t) = gar(tp + (1 — R(x), 2). 


In both cases h| X X [0, 1] is independent of M. 

If X is not locally compact it may not be possible to pick a deforma- 
tion h|X x [0, 1] satisfying the conditions of Theorems 3 or 3’ for 
all M. An example is the AR-set y=0}+) Su{x=1/n; 
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13 Alexandroff and Hopf, Topologie, I, p. 93. 


= 


RESOLUTION OF BOUNDARY PROBLEMS BY THE USE 
OF A GENERALIZED CONVOLUTION 


R. C. F. BARTELS AND R. V. CHURCHILL 


1. The Laplace transformation of the convolution. The generalized 
convolution F*(t) of F(t, t’) is defined as follows: 


F*() = fr 


In case F(t, t’) = F,(t) F2(t’), the function F*(¢) is the ordinary convolu- 
tion F, * F2, or Faltung,! of the two functions F; and F». 
Let L{ F*(t)} denote the Laplace transform of F* with respect to #, 


L{F*()} = f (dl, 


and let f(s) denote the iterated transform of F(t, t’), 


(1) f(s) = f f 
0 0 
It will be seen that 
(2) LiF*()} = fis), 
which, in terms of the inverse Laplace transformation, implies that 
= F*(). 


THEOREM. Let F(t, t’) be an integrable function of t and t' in every finite 
rectangle O<t<T, 0<t' ST’ and, for some real a, let F(t, t’)| 
be bounded for all t and t’ (t=0, t’=0). Then if R(s) >a, the integral 
L{ F*(t)} is absolutely convergent and satisfies the equation (2). 


Under the conditions stated, the iterated integral in (1) exists if 
R(s) >a and is equal to the absolutely convergent double integral 


f f #’)d(t, 


over the quadrant #20, t’2=0. However, the latter is equal to 


Presented to the Society, April 12, 1940, under the title An extension of Duhamel’s 
theorem; received by the editors March 28, 1941. 

1G. Doetsch, Theorie und Andwendung der Laplace-Transformation, Berlin, 1937, 
p. 155 ff. 
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(3) lim ff et t/)d(t, 

where 7; is the triangle bounded by the lines t’ =0, =0 and t+#’ =k. 

After the substitutions r=t+1’, r’=t’ have been made, the integral 

in (3) is transformed into the integral over the triangle bounded by 

the lines r’=0, r=7’, and r=k in the 7r’-plane. Hence (3) can be 

written 


k 
lim e—**dr f F(r — 7’, 7’) dr’, 
0 0 


which is L { F*(t)}. Therefore the equality in (2) holds. 

The convergence of the integral L{ | F*(t)| }, that is, the absolute 
convergence of the Laplace integral in (2), follows immediately from 
the absolute convergence of the limit in (3). 


2. The Duhamel integral formula. Let A and \ denote the differ- 
ential operators defined as follows: 


a 
A{U} + (x =), 
Ox; 


=a + Da 


where the coefficients of U and its derivatives are functions of x1, x2, 
and x3 only. Then, if P denotes an arbitrary interior point (x1, x2, x3) 
of a region R, and Q any point on the boundary S of R, a general 
boundary value problem for the temperature U(P, #) in the region R 
can be written 


=A{U}+F(P,)), t>0, 
»{U@, =GQ@, 4, t> 0, 
U(P, 0) = A(P), 


where F, G, and H are prescribed functions. In the second of these 
equations it is understood that r{ U(Q, t)} represents the limit of 
\{ U(P, t)} as P approaches the point Q on S in a prescribed manner. 
Similarly, in the last equation U(P, 0) is written for U(P, +0). 
Applying the Laplace transformation with respect to ¢ to the first 
two equations in problem (A) gives, in view of the last condition, 


—r 
Om 
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su(P, s) — H(P) = A{u(P, s)} + f(P, 5), 
rA{uQ, s)} = 5), 


where u, f, and g are the transforms of U, F, and G, respectively. 

Let the temperature function V(P, #, t’), depending on the fixed 
parameter ¢’, be the solution of problem (A) when the source function 
F and the surface-temperature function G have at each point of R 
and S, respectively, the constant values F(P, t’) and G(Q, t’). Then 


(A’) 


V(P, t,t’) = A{V(P, 4, ¢)} + 


B 


V(P, 0, ’) = H(P). 


Also let o(P, s, t’) represent the Laplace transformation of V(P, t, t’) 
with respect to ¢, and i(P, s) the iterated transform of V(P, t, t’) with 
respect tot and ¢’. Then applying the transformation with respect tot, 
it follows from (B) that 


so(P, s, — H(P) = A{o(P, S, t’)} + Fe, ’), 


1 


and consequently, on applying the transformation with respect to t’ 
to these equations, that 


1 1 
si(P, s) — H(P) = A{a(P, s)} +—f(P, 5). 
(B’) ‘ 
= — 8(@, 5). 
Upon multiplying each member of the equations in (B’) by s, it is 
at once evident that the problems (A’) and (B’) are equivalent and 


that the function si(P, s) is a solution of (A’). Thus, assuming that 
the solution of (A’) is unique, it follows that 


(4) u(P, s) = si(P, s). 


According to equation (2), 


iP, s) = L{V*(P, )} = V(P,t— 7, 
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Moreover, since V*(P, 0) =0, 


= si(P 
= si(P, s). 


Hence it follows from equation (4), on performing the inverse trans- 
formation, that 


art 
(5) al. V(P, t — 7, 7)dr. 


The solution of the problem (A) with variable source and surface condi- 
tions is therefore given by formula (5) in terms of the solution of the 
problem (B) with constant source and surface conditions. 

Sufficient conditions can, of course, be given in order to justify all 
of the steps in the formal derivation of (5). However, more relaxed 
conditions can be obtained in particular cases by verifying directly 
that the function given by (5) is a solution of problem (A). When the 
latter procedure is applied to the above general case, it is seen that the 
foregoing result in italics is true, provided that V, 0V/dx, 02V/dx7, 
and their derivatives with respect to ¢ are continuous functions of 
P, t, and t’ interior to R when t=0 and ¢t’=0, and that the function 
(0/dt) {\{ V(P, t—7, r)}dr is continuous with respect to P at the 
points Q when />0. 

The relation (5) is known in the theory of heat conduction as 
Duhamel’s integral formula.? It has been shown above that this for- 
mula applies to a very general temperature problem. It is clear that 
the procedure can be applied in case of discontinuous media, and to 
other boundary value problems as well. It is, of course, applicable to 
problems with partial differential equations of higher order than the 
second. 


3. Resolution of temperature problems. It will now be shown that 
the general problem (A) can be further resolved into still simpler 
problems which are of two basic types. It should first be observed 
that the solution of problem (B) can be written 


(6) V(P, t, = Vi(P, + t, + WP, 
where Vi, V2, and W are solutions of the problems: 
(a) A{Vi(P,#)} =0, af{ViQ,#)} =GQ@,¢); 


2J. M. C. Duhamel, Mémoir sur la méthode générale relative au mouvement de le 
chaleur dans les corps solides plongers dans des milieux dont la température varie avec 
le temps, Journal de 1’ Ecole Polytechnique, vol. 14 (1830), pp. 20-77; also see H. S. 
Carslaw, Conduction of Heat, 1921, pp. 16-19. 
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t,t’) = A{V2(P, #, 
(b) at 2 2 


»{V2(Q, =0, Va(P, 0, = H(P) — Vi(P, 
and 


W(P,t,¢) = A{W(P,t, ¢)} + F(P, 


W(P,0,/) =0. 
In terms of the transform w(P, s, t’) of W(P, t, t’), the last of the 
foregoing problems becomes 


sw(P, s, t’) — re, rA{wQ,s, =0. 
s 


On multiplying each member of these two equations by s, it is im- 
mediately evident that the function sw(P, s, t’) is a solution of the 
transform of the following problem: 


— V,(P, t, = t, 
V3Q, t, t’)} = 0, V3(P, 0, t’) F(P, t’). 
Therefore, if vs(P, s, t’) denotes the transform of V;(P, ¢, t’), then 
w=v;/s and, consequently, 


(7) W(P, = f V3(P, 1, t’)dr. 


According to equations (5), (6), and (7), the solution of problem 
(A) is given in terms of the solutions V;, V2, and V3 of the problems 
(a), (b), and (c), respectively, as follows: 


U(P, t) = f [ va, 


+ f VAP, 7’, dr, 
0 


which can be written 


(P, (P, t 2 ’ 
V3(P, t — 7, r)dr. 


| \dr 
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Problems (b) and (c) are variable state problems of the same type in 
which the initial condition alone is nonhomogeneous. On the other 
hand, (a) is a steady state problem. The general temperature problem 
(A) is thus resolved by formula (8) into the solution of problems of two 
basic types, one of steady state, the other, of variable state. 


4, Resolution of vibration problems. A general form of the problem 
of displacements in elastic media with variable boundary conditions, 
including forced vibrations of membranes, shafts, and strings, can be 
written 

a 
— 1) + b— OP, t) = + F(P, 
ot? ot 
(C) AL )} 
0) = H(P), = HP). 


As before, P denotes a point in a region R, and Qa point on the bound- 
ary of R; F, G, H, I, and b are prescribed functions, b being a function 
of P alone. The resolution of this problem can, as in the case of the 
foregoing temperature problem, be easily obtained by formal applica- 
tion of the Laplace transformation. The problem corresponding to (C) 
in s), the transform of ®(P, is 


(s? + bs)¢(P, s) — (s + 6)H(P) — I(P) = A{o(P, s)} + f(P, 5), 
s)} = g(Q, 5). 


Let the function 0(P, #, t’), depending upon the fixed parameter ?’, 
be the solution of problem (C) when F and G have at each point the 
constant values F(P, t’) and G(Q, t’), respectively, that is, 


(C’) 


O(P, t, #) +b O(P, t,t’) = A{O(P, t, + F(P, 
(D) 4, = GQ, 
O(P, 0, ¢’) = H(P), ap, 0, #’) = I(P). 
If 6(P, s) is the iterated transform of @(P, t, t’), it follows from (D) 


that s@ satisfies the equations (C’). Therefore, ¢(P, s) =s6(P, s) and, 
according to (2), we have the formula 


(9) f )d 
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Repeating the procedure of the preceding section, it is seen that 
t 
O(P, t, t’) = O,(P, t’) + 02(P, t, + f 0,(P, 7, t’)dr, 
0 


where ©,, 02, and 9; are the solutions of the following problems, re- 
spectively, 


(d) ’)} =0, 24010, ’)} =GQ@, £); 
0.(P ") = A{0.(P, #, 
Or o( + at (P,t,¢)=A 2( 
(e) ’)} =0, O2(P, 0, = H(P) — 7), 


— 0.(P, 0, t’) = I(P); 
ot 


3? 
— 0;(P, t, b— 03(P, = Aj}O3(P, t, 
(f) =0, O3(P, 0, =0, 
— @,(P, 0, ’) = F(P, 
a( ) (P, 
Therefore, in view of (9), 


(10) t 
a( ) 


Thus the solution of the general problem (C) is resolved into the solution 
of the two basic types (d), and (e). Problem (f) is a special case of (e). 
The latter can, of course, be still further resolved into two simpler 
problems, one being of the same form as (f) with the second initial 
condition nonhomogeneous, and the other having the nonhomogene- 
ity occurring in the first initial condition. 
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ROOTS OF CERTAIN CLASSES OF POLYNOMIALS 
LOUIS WEISNER 


It is well known! that if the roots of the polynomials ¢(z) and F(z) 
are real, so are the roots of the polynomial ¢(D) F(z), where D=d/dz. 
This result has been applied to certain types of entire functions and 
trigonometric integrals.? The following example illustrates the method 
employed. If 


(1) fe) = 
k=0 
is a polynomial whose roots Ai, - - - , A, lie on the unit circle, then the 


roots of the polynomials 


F,(2) = cal] [(1 + 2/p)? — (1 — 2/p)?], p= 1, 
k=l 
lie on the axis of pure imaginaries. Therefore, if the roots of the poly- 
nomial ¢(z) lie on the axis of pure imaginaries, so do the roots of the 
polynomials* ¢(D) F,(z), p=1, 2, - - - . Now the sequence of polyno- 
mials { F,(z)} converges uniformly in every finite region to the func- 
tion 


F(z) = e~™* f(e?*) = > 


k=0 


and the sequence {¢(D)F,(z)} converges likewise to 


¢(D)F(z) = — 


k=0 


The roots of ¢(D) F(z) therefore lie on the axis of pure imaginaries. 
Removing the innocuous factor e~"‘, and replacing e”* by 2, the fol- 
lowing theorem results: If the roots of f(z) lie on the unit circle, and the 
roots of o(z) lie on the axis of pure imaginaries, then the roots of the 
polynomial 


Presented to the Society, September 5, 1941; received by the editors June 6, 1941. 

1 Ch. Hermite, Nouvelles Annales de Mathématiques, vol. 5 (1866), p. 479; Pélya 
and Szegé, Aufgaben und Lehrsdtze aus der Analysis II, p. 47, Problem 62. 

2 See G. Pélya, Uber trigonometrische Integrale mit nur reellen Nullstellen, Journal 
fiir die reine und angewandte Mathematik, vol. 158 (1927), pp. 6-18. 

3 Replacing z by iz it follows from the above theorem of Hermite that if the roots 
of ¢(z) and F(z) lie on the axis of pure imaginaries, so do the roots of ¢(D) F(z). 
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(2) g(z) = cp(2k — n)z* 
k=0 
lie on the unit circle. 

This theorem provides us with a large class of polynomials whose 
coefficients are explicitly given and whose roots lie on the unit circle. 
As the theorem itself concerns polynomials, while the proof is trans- 
cendental, an elementary proof is desirable. An elementary proof, 
with extensions, follows. 

From the polynomials f(z) and ¢(z), where f(z) has the explicit form 
(1), construct the polynomial g(z) defined by (2). Setting 


¥(2) = — n), 


we have 

g(a) 

k=0 

If a1, - - * , @m are the roots of (z), so that 

¥(z) = (z — ax), 

k=1 

then 
(3) g(2) = a] (2D — = 


k=1 


Now suppose that the roots of ¢(z) lie on the axis of pure imagi- 
naries and the roots of f(z) in the ring | z| <r2, where 712. 
From the relation between ¢(z) and ¥(z) it is evident that the roots 
of ¥(z) lie on the line Rz=n/2. Let a be a root of ¥(z) and ¢ a root of 
the equation 


(4) af'(z) — af(z) = 0. 


If ¢ is also a root of f’(z), £ is a multiple root of f(z) and hence lies in 
the ring r1<|z| <re. In the contrary case the centroid of f(z) relative 
to ¢ is different from ¢. This centroid is‘ =(1—n/a)f, 
by (4). From Ra = n/2, we infer that | 1 —n/a| =1;therefore = 

If ¢ is outside the ring 11S | z| <r2, the circle with center at the origin 
which passes through ¢ and ¢; fails to separate the roots of f(z), in 
violation of a theorem of Laguerre.' Therefore all the roots of (4) 


* E. Laguerre, Oeuvres, vol. 1, p. 56. 
5 Laguerre, loc. cit., p. 57, p. 134; Pélya and Szegé, loc. cit., p. 57, Problem 106. 
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lie in the ring r1S | z| <r.. Applying this result repeatedly it follows 
from (3) that the roots of g(z) lie in the ring. 


THEOREM 1. If the roots of $(z) lie on the axis of pure imaginaries, 
and the roots of f(z) in the ring n= | z| <re, then the roots of g(z) lie in 
this ring. 


Coro.iary 1. If the roots of o(z) lie on the axis of pure imaginaries, 
and the roots of f(z) on the circle | z| =r, then the roots of g(z) lie on this 
circle. 


CoroLuary 2. If the roots of o(z) lie on the axis of pure imaginaries, 
the roots of the polynomial 


o(2k — n)z* 


k=0 
lie on the unit circle. 


To prove the first corollary, take r,=r2=r. To prove the second, 
take f(z) =1+z2+ --- +2". 

We suppose now that the roots of f(z) lie in the circular region 
| 2| <r, and that the roots of $(z) lie in the half-plane Rz<0. We 
reéxamine (4), retaining our previous notation. If a=0, the roots of 
(4) lie in the region | z| <r by the Gauss-Lucas theorem; hence we 
need only consider the case a~0. We may also suppose that ¢ is not 
a root of f’(z). 

We now have Ra<n/2; hence |1—n/a| 21, and |f,| =|¢|. If 
\¢ | >r, then \¢ 1| >r, and a circle may be drawn through ¢ and §; 
which fails to separate the roots of f(z). As this result contradicts 
Laguerre’s theorem, the region | z| <r includes all the roots of (4) 
and consequently all the roots of g(z). 


THEOREM 2. If the roots of o(z) lie in the half-plane Rz<0, and the 
roots of f(z) lie in the circular region | z| <r, then the roots of g(z) le 
in this circular region. 


Theorem 2 may be extended to the case in which ¢(z) is the limit 
of a sequence of polynomials whose roots lie in the half-plane Rz<0 
and which converges uniformly in every finite region.’ Likewise 
Theorem 1 and its corollaries are valid if @(z) is the limit of a se- 


6 Entire functions which are limits of such sequences of polynomials are charac- 
terized by Eduard Benz, Uber lineare, verschiebungstreue Funktionaloperati und 
die Nullstellen von ganzen Funktionen, Commentarii Mathematici Helvetici, vol. 7 
(1935), p. 246. 
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quence of polynomials whose roots lie on the axis of pure imaginaries 
and which converges uniformly in every finite region. 


HuNTER COLLEGE 


GENERALIZED LAPLACE INTEGRALS 
R. P. BOAS, JR. 


We consider the linear space $(c) whose elements are functions f(z) 
[z=x+iy] which are analytic for x >c and satisfy 


(1) + in) Pay 
where the finite number M depends on the function in question. It 
is well known that an element f(z) of S(c) has boundary values 
f(e+ty) almost everywhere on x=c, and that §(c) is a Hilbert space 
if the norm of f(z) is defined by 


= + ay) bay. 


Furthermore, it is known [5, p. 8] that if f(z)€G(c), then f(z) is 
representable as a Laplace integral for x>c, in the sense that there 
is a unique function! ¢(¢) with e~*¢(t) EL2(0, ©) such that 


(2) lim 


T 
- f = 0; 
0 
we shall express (2) by writing 
(3) f(z) -f x> ¢. 
0 


It is easily verified that the integral in (3) converges in the ordinary 
sense for x >c. A Laplace integral may be regarded as a generalized 
power series; the object of this note is to generalize the integral repre- 
sentation (3) by replacing e~*‘ by a kernel g(z, ¢) which is in some sense 
“nearly” e~*‘, just as power series ).a,2" have been generalized? by 
replacing the functions 2" by functions g,(z). 


Presented to the Society, September 5, 1941; received by the editors May 24, 1941. 
1 Unique, that is, up to sets of measure zero. 
2 For a bibliography of this problem, see [1]. 
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We suppose that g(z, ¢) is, for each fixed ¢ in 0<t< ©, an analytic 
function of z in x >c; and that for each fixed z in x >c, g(z, t) EL(0, R) 
for every R>0. 


THEOREM 1. If for each positive T 


T 
(4) € T) implies f g(z, H(c); 
0 


f i a(t)e~**dt | 


for all S>R>0 and all functions a(t)EL*(R, S), then there exists for 
each f(z) EH(c) a unique Y(t) with e4p(t) EL2(0, ©) such that 


and tf there is a number X, 0<d <1, such that 


(5) 


f “all [e-** — g(z, 
R 


Ex 


(6) se) = f als, 
0 


where the integral is a mean-square limit for x =>c, and also converges in 
the ordinary sense for x >c. 


In Theorems 2, 3 and 5 we shall replace the conditions of Theorem 1 
by more convenient conditions; in Theorem 6 the theory will be ap- 
plied to the generalized Laplace integrals recently discussed by Meijer 
[4] and Greenwood [7], namely . 


(=)" 


where K,(z) is the usual notation for a Bessel function of imaginary 
argument [6, p. 78], and —}<(v) <3 (if v= +3, we have (3) again). 
Theorem 1 is easily proved by the method of successive approxi- 
mations used for a similar purpose by Paley and Wiener [5, p. 100]. 
Let f(z) €H(c). Then there is a function ¢(¢) with e~*o(¢) ECL2(0, 
such that 


here, and throughout the proof, integrals over (0, ©) are taken as 
mean-square limits, as in (2). Then the integral 


fa. 
0 
exists; for, by (5), 
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s 
f o(t)e~**dt 
R 
as Rand S-~o. 


We now define inductively sequences { ¢n(t)} and { fa(z)} by set- 
ting go(t) 


| — g(z, < 


and generally 


a) = f 


= f — g(z, Jat. 


Thus 
0 k=0 
and by (5) 
forall = al fl 
(7) 
Consequently 
| | 
Now 
and hence 


| > < 0, m,n— ©. 
i” 0 m 


Therefore, since $(c) is complete, there exists F(z)€(c) such that 


0 k=0 


— 0, n—o, 


and 


F(z) = f eH (t) € L*(0, 
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If we set 


(10) val) = — 
k=0 


we then have 


f gle, 


then by (5) 


By (10) and (8) we thus have 


(iat | — 0. 


and the existence of the representation (6) is established. That (6) 
converges in the ordinary sense for x >c follows easily from the fact 
that functions of §(c) are represented by their Cauchy integrals [2, p. 
338]. 

To show that the representation is unique, we have only to show 
that 


implies w(t) =0 almost everywhere. Now if (11) is true, for every posi- 


tive T we have, by (5), 
T 
f w(t) — g(z, 2) ]dt f w(é)g(z, pat|| 
0 0 


The limit of the last term on the right is zero, and since \<1 it fol- 
lows that 


7 | 


=, 


f w(t)e—*'dt = 0, 
0 
Since the Laplace representation (3) is unique, w(¢) =0 almost every- 
where. 

THEOREM 2. If forx>c 
(12) g(z, = e*{1+ h(z, d)} 
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with 
h(z, t) = f e~*“w(t, u)du, 
| w(t, | = w(x), 
and 
(13) h(c) < 1, 


then the function g(z, t) has properties (4) and (5), and consequently the 
representation (6) is possible and unique. 


If L?(R, S), we set a*(¢) =a(t) in (R, S) and a*(t) =0 outside 
(R, S), and use the Parseval theorem for Laplace transforms; with 
z=c-+y, 


s 2 
f e~*ta(t)h(z, | dy 
R i 


2 


1 
= a*(t)w(t, » — d 
1 
< — | a*(t) | w(v — 
2rJ 0 


e~**dv 


| a*(t) | w(v — | dy 


2 
a*(2) | at f e~**w(u)du | d 
0 0 


2 
e~##| a*(Z) | a| dy 


é jo | a*(¢) | de lay 


y 


*dy. 


- 
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Since h(c) <1, this gives us (5). But it also implies 
2 1/2 
J 
2 1/2 
< {1+ { f f | ay\ 
0 


{1 + h(c)} { | og 


f iat 


and hence (4). 
For example, (13) is satisfied, with c=1, if h(z, t)=2/(z?+4#?) or 
1/(z+2). 


THEOREM 3. If h(z, t) is defined by (12) and satisfies 
(14) | A(z, #)| <| | 10), 


with (2)|| | =p then g(z, t) has properties (4) 
and (5) and the representation (6) is possible. 


In fact, we have 


8 
| f a(t)e-**h(z, | f | a(t) | dt, 
R R 


| J | 

< { | 10) * a(t) ah ue 
. 


s 
f a(t)e~*'dt 
R 


This verifies (5); and (4) follows because 


S 


| = | + h(z, 


+ |] oa 


. 


< [1+ 
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THEOREM 4. If g(z, t) =g*(z, t) for t2s>0; if the representation (6) 
is possible and unique for x >c*, with g(z, t), for every f(z) of S(c*); and 
possible and unique for x >c, with g*(z, t), for every f(z) of H(c) (where 
c <c*), then the representation is also possible and unique for x >c, with 


g(t, 2), for every f(z) of H(c). 


We have 


(15) f(z) = f g(z, 4 > 


Define 
Fe) = se) fate, = fete, f OW 


The last integral defines F(z) for x >c; and since the representation 
of F(z) in terms of g*(z, ¢) is unique for x >c, the integral 


fate 


must represent F(z) for x >c; hence (15) represents f(z) for x >c also. 


THEOREM 5. Theorem 3 remains true if (14) is satisfied with 


k(z)EH(c) and ~). 
If k(z)€H(c) we have 
(16) lim "| k(x + iy) = 0; 


for, if x>c, 


k(x + iy) = f dt, € L2(0, ©); 


and so 


1 
T 0 


Choose s so large that 


1/2 
(17) ( f | at) =p < 


Then define h*(z, t)=h(z, t) for t>s, h*(z, t)=0 for OStSs; and 


| 
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g*(z, t) =e-**[1+h*(z, t)]. The function g*(z, t) satisfies the condi- 
tions of Theorem 3, with /(t) replaced by the function /*(#) equal to 
I(t) in (s, ©) and to zero in (0, s). On the other hand, if b is so large 


that 
1/2 1/2 


(such a b exists because of (16)), the function g(z, ¢) satisfies the con- 
ditions of Theorem 3 with c replaced by b. Thus the representation (6) 
is possible and unique with g(z, ¢) for x >b, and with g*(z, t) for x>c. 
Theorem 5 now follows from Theorem 4. 

Let ao have its usual meaning in the theory of Bessel functions 
[6, p. 78]. 


THEOREM 6.3 If —}<R(v) <}, then for every f(z)EH(0) there is a 
unique ©) such that 


(18) fle) = (=)" 


where the integral is a mean-square limit, as in (2), for x20, and con- 
verges in the ordinary sense for x>0. 


We shall show that the function (2zt/:)'/?K,(zt) satisfies the hy- 
potheses of Theorem 5, with c=0. The following inequalities for K,(z) 
are valid for R(z) >0 [6, p. 219; 3, p. 658]. 


(19) = Ro(z), 
(20) = 1 + R,(z)/(2z2), 
where 
COS vr 
| Ri(z)| < |’ j= 0,1. 


If now h(z, t) is defined by 
= e-**[1 + h(z, A], 
from (20) and (19) we have, with A(v) depending only on », 
| A(z, t)| < r>0,0<t<o. 


* A closely related theorem is given by Meijer [4, p. 603]. Meijer also obtains an 
inversion formula for (18). 
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Hence, with some B(v), 
B(v) 
z|(1+2) 


and the conditions of Theorem 5 are satisfied. 


| 


» £>0,0<t<a, 
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DvuKE UNIVERSITY 


CLASSES OF MAXIMUM NUMBERS ASSOCIATED WITH 
TWO SYMMETRIC EQUATIONS 


H. A. SIMMONS 


1. Introduction. Let >>; (1/x) stand for the elementary symmetric 
function of the jth order of the z reciprocals (1/x,) (p=1, 2, - - -,i>0) 
with 


> (1/x) =0 when i<jorj<0O, 


1 when j=0 


(>:.(x) having a similar meaning for the x, themselves). 

Here we extend the work of papers I,! II,? III* by obtaining relative 
to equations (1) and (1.1) below results analogous to those in I, II, 
III 


(1)! (1/2) + = 


a=(c+1)b—1, x(x) = Xn, 


(1.1) (1/x) +4 (1/2) +u DY (1/x) = b/a; 


n,n—2 


in (1), b, c, and m are arbitrary positive integers, n>1, and the a; 
are any non-negative real numbers; in (1.1), a and 8 are as in (1), 
n>2, is a non-negative integer, and yp is a positive integer. 

We have not seen previous mention of (1); the case of (1.1) in 
which p =0 was treated in II and that in which A=p=1 was treated 
in III. Our procedure for (1) does not suffice for the equation that is 
obtained by adding to the left member of (1.1) the terms 


> a;[x(x) 


The following definitions and notation from I will be frequently 


Presented to the Society, September 5, 1941; received by the editors June 27, 1941. 

1H. A. Simmons, Transactions of this Society, vol. 34 (1932), pp. 876-907. 

2 Norma Stelford and H. A. Simmons, this Bulletin, vol. 40 (1934), pp. 884-894. 

3H. A. Simmons and W. E. Block, Duke Mathematical Journal, vol. 2 (1936), 
pp. 317-340. 

4 In so far as we know, the form of the right member of equation (1) was first used 
by Tanzo Takenouchi in the Proceedings of the Physico-mathematical Society of 
Japan, (3), vol. 3 (1921), pp. 78-92. 
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used here: x1...), 13pm, stands for the set (x, x2,---, Xp); 
P(x)=P(x1, x2,--+, X,) stands for a polynomial, not a constant, 
which is symmetric in the x; (¢=1, 2, ---, m) and has at least one 


positive, and no negative, coefficient; the Kellogg solution of equation 
(e), where e stands for (1) or (1.1), is the solution that is obtained by 
minimizing the variables x, x2, - - - , X,-1 in this order, one at a time, 
in (e) among positive integers; an E-solution of (e) is any solution of 
it in which x:S%2S --- Sx, while x1, x2,---, x, are positive in- 
tegers. When any further definition or notation from I, II, or III is 
used here, a suitable reference to the appropriate article will be given. 

We can now state accurately our purpose here. It is to prove that 
the Kellogg solution w of equation (e) has the following two proper- 
ties, which were called remarkable properties in III: (i) It contains the 
largest number that exists in any E-solution of (e) and no other 
E-solution of (e) contains this number. (ii) If X, with X#w, is an 
E-solution of (e), then P(X) < P(w). 

The discussion from §2 to the end of this paper is divided into two 
parts as follows: Part 1 treats (1), §§2 to 6 (inclusive); Part 2, (1.1), 
§§7 to 12. 

This paper involves innovations of notation and procedure of I, II, 
III. The terms set o and set r, which were important in I, II, III, are 
not used here; they are not needed because of our use of a new term 
that is very convenient for present purposes, namely s-set (cf. §4). 
This change is accompanied by new procedure for both (1) and (1.1): 
in Part 1, we use a new lemma, namely Lemma 4.1; in Part 2, we in- 
troduce an upper bound R(X) (cf. §8) for the maximum number that 
we seek to identify and we show that R(X) is uniquely maximized, 
with respect to values that R(x) can assume on E-solutions X of (1.1), 
by the Kellogg solution of (1.1). In so far as we know, our reasoning 
about R(X) in §11 affords the first strong resemblance of our proce- 
dure (for identifying maximum numbers) to that which Curtiss® used 
in solving Kellogg’s problem.® 


ParT 1. THE REMARKABLE PROPERTIES OF THE KELLOGG 
SOLUTION OF (1) 
2. The Kellogg solution of (1). This solution is x = w where [I, (23) ] 


(2) Wp=1 (p=1,2,---,m— 2), 


5D. R. Curtiss, American Mathematical Monthly, vol. 29 (1922), pp. 380-387, 
and note, in particular, his upper bound (10), p. 384. 
®O. D. Kellogg, ibid., vol. 28 (1921), pp. 300-303. 


1942] TWO SYMMETRIC EQUATIONS 297 


with w, defined to be the unique positive solution x, of the equation 
that is obtained by substituting in (1) for each x, (p=1, 2, - - -,m—1) 
its value w, from (2). 

If »=2, only the last equation in (2) is to be retained. 


3. Our transformation. In considering an E-solution X #w of (1), 
we classify and transform elements as we did in §§15, 17 of I. Thus 
we define our transformation of X (X1...,) into anew set X’ by (#1) 
or (#2) [I, (33) and (52) ]: 


(1): Xp = 91.193 Sm), Xq = Wey Q(1/X') = OM/X); 
(te): x; = X,(p 71,19; p Nn), Xia = Wi Q(1/X’) = Q(1/X); 


according as (t:) requires X;, to be not greater than w,, or greater 
than w,,, respectively, where Q(1/X) is the case x=X of the left 
member of (1); if (t:) defines X{, to be equal to w,¢, (t:) and (t2) are 
the same transformation. 

If X’w, our transformation from X’ to X”’ is obtained from (3) 
by replacing in (3) X, X’, g by X’, X’’, q’, respectively, where 
Xy (Xie) is of class A’(B’), and the new transformation is regarded 
as a transformation (3). Thus we avoid giving here an analogue of 
(52) of I. 

Replacement of (1) by (1.1) in this section gives our transformation 
for §§11, 12; it will be called (3a). 


4, Important lemmas; s-set. In the sequel, we use Lemma 4 and 
Lemma 4.1 below. Lemma 4.1 depends on Lemma 4, which is essen- 
tially Lemma 1a of I. 


(3) 


Lema 4. Let Q(1/x) stand for a symmetric polynomial in the n re- 
ciprocals (1/xp) (p=1, 2,- ,m>1) which ts not a mere constant and 
contains at least one positive, and no negative, coefficient; with 1 and j 
equal to distinct positive integers each less than or equal to n, let x;, xj, 
a, B be positive numbers with a<x;Sx;; and suppose that the expression 
that is obtained by replacing in Q(1/x) the numbers x:, x; by (xi—a), 
(xj;+8), respectively, equals Q(1/x), then 


(4) xx; S (x; — a)(x; + 8), + < (4% — a)" + (x5 + 8)", 


where h is a positive integer. Furthermore, the equality sign holds in (4) 
if, and only if, Q(1/x) is a polynomial in [x(x) ]-. 


In the proof of Lemma 4.1 and in §§6, 12, we use the following 
definition. 
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DEFINITION OF s-set. If in a set X“ #w every element of class B“ 
[I, p. 898] is at least as large as every element of class A, we call X 
an s-set (relative to w), s meaning satisfactory in the sense that X™ can 
be transformed into w by one or more transformations of type (3) every 
one of which accords with (4). 


Lemna 4.1. Let k bean integer greater than or equal to1;leeW=W,...» 
(v>1) be the Kellogg solution of the equation (x:x2 - - - x»)~'=k~!; and 


let X=X,..., be a set of v positive integers with X25 --- SX, 
satisfying the relation 

(4.1) x) 1S 

then 

(4.2) > (1/X) s (1/W), 1<s<p/ 


with < holding in (4.2) except when X = W. 
Proor. We first consider the case where 
(4.3) +X, = 


Then X;..., is an s-set (relative to W). Therefore P(X)=P(W), 
[I, Lemma 3], the equality sign holding in this relation if, and only if, 
X = W or P(X) is a polynomial in the product of all of the v variables 


Xi, X2,---, In particular, then, when P(x) =) we have 
(4.4) (W), 1<r<y, 


the equality sign holding in (4.4) if, and only if, X = W since r<v. 
But, using (4.3), we find 


> (Xx) > (xX) D> (W) 


while, by (4.4), 
(4.6) DM), 1<v—r<v. 


From (4.5) and (4.6), it follows that in the present case (4.2) holds. 
Suppose now that < holds in the case x =X of (4.1), say 


(4.7) (XiX2--- = (k’), 


7 The case s=v is excluded merely for convenience in our applications of Lemma 
4.1; the case k=1 is included for convenience in writing, not for use. 


v.8 
| 
v,U-T 
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where k’ is an integer greater than k. By considering the Kellogg solu- 
tion of (4.7), one can prove that (4.2) still holds: indeed if the Kellogg 
solution of (4.7) is U, one finds readily that 


5. Proof of Property (i) for the w of (1). We substitute any E-solu- 
tion X ~w of (1) for x in (1) and employ the following equivalent of 
the resulting equation (_;,; standing for >°;,;(1/X) here as in the 
sequel) 

2 ta ) + ( )=5/a. 
n—1,n—1 n—1,n—2 n—1,n—1 n—1,n—1 

In order to establish Property (i), it suffices to prove that w has 
Property (i) when m=2 and to prove that the following relations hold 

n—1,n—1 n—1,n—1 n—1,n—2 n—1,n—2 


When »=2, equation (1) reduces to 
(5.2) Xi (t+ (0:X1)X: 
i=2 


In this case, X ~w implies that X,<w. This fact and (5.2) imply that 
X2<w» [I, Lemma 2]. 

When n>2, the first relation of (5.1) is true by the definition of 
Kellogg solution since (cf. (2)) 

(1/w), 
n—1,n—1 n—1,n—1 

and since X ~w, the second relation of (5.1) follows from Lemma 4.1 
(cf. the case of (4.2) in which X #w and (v, s)=(n"—1, m—2) with 
n>2). 


6. Proof that the w of (1) has Property (ii). Let X #w be an E-solu- 
tion of (1). The discussion of the case m=2 in §5 shows that in this 
case X is an s-set (so that P(X) <P(w)). Suppose n >2. Then, by §5, 
Xn <Wn,, and by (2) every classified element of X1.. .(n—2) is of class A. 
Therefore, whether X,_1 is of class A, class B, or unclassified (=wp,-_1), 
X is an s-set. 


Part 2. THE REMARKABLE PROPERTIES OF THE KELLOGG 
SOLUTION OF (1.1) 


7. The Kellogg solution of (1.1). This solution is x=w where 
[I, (23)] 


0,8 
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w, = 1, = 1,2 ,n — 3, 
(7) 
n—1,2 n—1,1 


If n=3, the first set of equations in (7) is, of course, to be omitted. 


8. An upper bound for X,. In the sequel X stands for an E-solu- 
tion of (1.1), arbitrary except as we specify. 

If we substitute X for x in (1.1) and solve the resulting Scien 
for X,, we find after simple algebraic manipulations that 


(xy+ra (x) +4] 


n—1,2 n—1,1 
(8) 
n—1,1 
Since the X, (p=1, 2, - - - , m—1) are positive integers, the second 


factor in the right member of (8) is the reciprocal of a positive integer. 
Therefore, 


(8.1) > (x) +4], 


n—1,2 n—1,1 


9. An inequality for X,_, when X, is the maximum number. In 
the sequel, the statement that X ~w and X,, is the maximum number 
that we seek to identify (so that X, 2w,) will be referred to as hypothe- 
sis H, or merely as H. We use H henceforth until a contradiction of 
it is reached in §11. 

Under hypothesis H, we now desire to prove that 


(9) 

Suppose that this is not true, so that (with H holding) 

(9.1) Xn-1 > Wa-1. 

We presently contradict (9.1). The case x = X of (1.1) is equivalent to 


n—2,n—2 n—2,n—3 n—2,n—2 


(9.2) 


+ (1/X,)( +02 +02 ) = 


n—1,n—3 n—1,n—2 n—1,n— 


a=(c+1)d—1, 
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with m >2. To reach the contradiction, we first establish the following 
relations 


(9.3) > = d (1/), n> 2, 


n—2,n—2 n—2,n—2 


(9.4) s DY (t/w +a (i/w), n>2. 
n—2,n—3 n—2,n—2 n—2,n—3 n—2,n—2 

Since X isan E-solution not equal towof (1.1) anda >2, 

2c+1=wiw2--- therefore, (9.3) is true. Consequently, to 

prove (9.4), it suffices to show that 


(9.5) > (1/»), n> 2. 
n—2,n—3 n—2,n—3 
When =3, (9.5) states that 1=1; when (9.5) is a case of Lemma 
4.1 in which X ¥ W and (v, s) =(n—2, n—3) with n—3 21; therefore, 
(9.5) is true. 
Next, using (9.1), (9.3), and (9.4), we find that the sum of the terms 
in the first line of (9.2) is less than U, where 


n—2,n—2 n—2,n—3 n—2,n—2 

Indeed, if in the first line of (9.2) we should replace X,_1 by X,-1—1, 

the resulting expression would not exceed U. Consequently, there ex- 

ists for (1.1) an E-solution Y in which 


(9.6) Y,= Xp (p = 1,2,---,"— 2), Y,-1 = Xn-1 — 1, Y, > Xa, 


and the inequality in (9.6) contradicts H. Hence, under hypothesis H, 
(9.1) is false. 


10. On the classification of the elements of X;...(n.-1) when H 
holds. For use in §11, the following statement, S, will presently be 
proved: In X1...(n—1) every element of class B is at least as large as 
every element of class A. 

To avoid vacuous language in the proof of S,* we consider sepa- 
rately the cases n=3 and 

Case n=3. Here Xi... (n—-1) =(Xi, X2), and either X,2w, (p=1, 2) 
or X1>wi, X2<w2; in both cases S is true. 

Case n>3. Here, by (7), any classified element of X1...(n-s) is of 
class A; X,_2 is of class A, class B, or unclassified (=w,_1); and by 
(9) X,_-1 is either unclassified or of class B. Therefore S is true. 


8 If n=3, it is vacuous to say that any classified element of X1...(n—3) is of class A 
(cf. our discussion of the case n >3). 
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11. Proof of Property (i) for the w of (1.1). If X1... 1) contains 
noelement of class B, X #wimplies that X, <w,, which contradicts H. 

Suppose that X1...-1) contains at least one element of class B 
and, therefore, at least one element of class A, since the first classified 
element of X is necessarily of class A. Then, by S, every application 
of transformation (3a) to X or to an intermediate set of X [I, p. 898], 
which does not change the magnitude of the mth element of a set, 
accords with (4). Further, the last such transformation in the ex- 
haustive set for X [I, p. 898] yields a set X in which XY =X,2w, 
(cf. H) and 
(11) Su, =p =1,2,---, 
otherwise X“ would be a set satisfying (1.1) and having at least one 
element of class A“ and no element of class B“, which is impossible. 
We reach a contradiction of H as follows. Let R(X) stand for the right 
member of (8.1), so that X, = R(X). Certainly R(X) is expressible in 
the form 


R(X) =F + G(Xo, + 


in which F, G, H are positive and independent of X,, and X,,, while 
X4,<X,¢ [I, p. 898]; therefore, the first transformation (3a) that one 
uses in passing from X to X is such that R(X) < R(X’) (cf. (4)). 
If ¢>1, our transformation of X’ into X”’ is such that R(X’) <R(X”’), 
and so on. On arriving at X“, one has 


(11.1) Xn S R(X) < R(X’) S R(X), #21. 


But by (11) and the fact that R(X) depends only on the first n—1 
elements of X, we have 


(11.2) R(X) < R(w). 


By (11.1) and (11.2), X¥,<R(w); and since R(w)=w, (cf. (7)), 
X,n<W,. This contradicts H. 


12. Proof that the w of (1.1) has Property (ii). H negated, we now 
merely suppose that X#w. We again avoid vacuous language by 
treating separately the cases n=3 and n>3. 

If n=3, X3<ws (cf. §11) and either X,=w, (p=1, 2) or X1>wi, 
X2<w2; in either case X is an s-set (and P(X) <P(w)). 

If n>3, X, is of class B, and every classified element of X;. . . (n—) 
is of class A (cf. (7)). Then if one of X,_2, Xn-1 is unclassified, X is 
an s-set; the same is true if X,_2, X,_1 are of the same class or if 
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Xn-2 (Xn-1) is of class A (B). Therefore, we only need to consider the 
case in which X,_2 (X,n_1) is of class B (A). Then 


and X,...(,-2) contains one or more elements of class A (preceding 
the element X,_2, of class B). Apply transformation (3a) to X, or to X 
and one or more intermediate sets of X, until a set X “ is obtained in 
which X“...(,-2) does not contain both an element of class A “ and an 
element of class B“. Since each transformation that has been applied 
to this point has increased the (m—2)d element of a set and decreased 
a not larger element (with subscript less than  —2) each transforma- 
tion applied has accorded with (4), so that necessarily 


(cf. the first two lines below (4)), and 


xs > w,, p=1,2,---,n—2, 
with > holding for at least one of the indicated values of p (cf. (12)). 
Further, by hypothesis X,1>w,-1, and by §11 X,<w,, while no 
transformation used in arriving at X“ has changed the value of the 
(n—1)th or nth element of a set. Therefore, 
with > holding in (12.1) for at least one of the indicated values of p. 
If u is a value of » for which > holds in (12.1), then X® < X, since 
transformation (3a) never increases the value of an element of class 
A“), Consequently, the classified elements of X“, like the elements 
of X, do not decrease as their subscripts increase, and so the X“ of 
(12.1) is an s-set. 


NORTHWESTERN UNIVERSITY 


ON A FAMILY OF FOURIER TRANSFORMS 
AUREL WINTNER 


A procedure leading to specific properties of the symmetric stable 
distribution functions results by subjecting certain generalizations of 
the Bessel functions J,(¢) to a limit process.! There arises the question 
as to the existence of the corresponding distributions in case the limit 
process involved is omitted. The object of this note is to delimit the 
conditions under which the answer is affirmative. It turns out that the 
situation is quite different from that resulting in the limiting case of 
stability. 

Let L(t; @) denote the Fourier transform, 


Lit; $) -f e*'=d(x), <t<o, 


of a distribution function ¢=¢(x), — © <x< ©, that is, of a mono- 
tone function satisfying the boundary conditions ¢(— ©)=0 and 
o( ©) =1. If ’ denotes the derivative of @ (a derivative which neces- 
sarily exists and is finite almost everywhere), the Stieltjes integral 
L(t; ¢) reduces to 


L(t; ¢) = f 
if and only if ¢ is absolutely continuous; in which case $’(x) is called 
the density of ¢(x). 

For a real or complex number 2, let 2; denote z or 0 according as z 
is or is not positive. Thus, if A>0O and w>O0, the even function 
L=(1i—|t|»)4 of the real variable ¢ represents a continuous curve in 
a (t, L)-plane, this symmetric curve being situated above or on the 
t-axis according as |t| <1 or 1<|t| < ©. It should be noted for later 
reference that the curve has at t=0 a cusp with a tangent perpendicu- 
lar to the t-axis, a corner with two distinct finite slopes or a tangent 
parallel to the t-axis, according as A\<1, A=1 or A>1, while y is arbi- 
trary. 

If X\=1=y, the function (1—|#|*)4 is 1—|t] or 0 according as 
|t]<1 or || 21, and represents therefore the Fourier transform 
L(t; ¢) of an absolutely continuous distribution function $(x). This is 


Received by the editors July 10, 1941 
1 A. Wintner, On a class of Fourier transforms, American Journal of Mathematics, 
vol. 58 (1936), pp. 45-90. 
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seen either by subjecting the kernel of Dirichlet’s factor of discon- 
tinuity to the process of auto-convolution or by applying a Fourier 
inversion. In fact, 


cos tx or dx = 
0 if 15 |t| < @; 


lA 


so that the density $’(x) is the square ( )? on the left. 


Thus, if (A, w) =(1, 1), there exists a distribution function ¢y, =¢y,(x) 
such that 


if 1, 


There arises the question for what additional pairs (A, u) does (1) 
determine a distribution function g¢,. Since the derivative 


(2) = (sin x/2)*/ (xx/2/2)2, 


though non-negative, has zeros, no continuity consideration can indi- 
cate that there exists a distribution function ¢), for certain pairs (A, ») 
sufficiently close to (1, 1); in fact, the derivative ¢,/ (x) might then 
become negative, precluding a distribution function ¢,(x). 

Since the function (1— | ¢| )4, of t is integrable over (— ©, ©), it is 
clear by Fourier inversion that, if there exists at all a distribution 
function gy, satisfying (1), then ¢, is absolutely continuous and has a 
density ¢,/ (x) =0 proportional to 


1 
(3) = ata 
0 
the factor of proportionality being 1/z for arbitrary (A, u). Accord- 


ingly, the question can be restated by asking, which are those pairs 
(A, #) for which the function (3) satisfies the conditions represented by 


(4) = 0, 0<2x< oa, 
and 
(5) < ©, 


If the unit of length on the x-axis is changed in the proportion 1:a, 
the definition of the Fourier transform L(t; @) shows that the unit of 
length on the ¢-axis must be changed in the proportion a:1. Hence, if 
a=yp", the function (1) must be replaced by (1 — if |t| <p, 
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and O if |t| This function of tends, as ©, to exp (— | ¢| for 
— «© <t<o. This means that the limiting case ¢).. of @, is the sym- 
metric stable distribution function of index \; a distribution function 
which is known to exist? if and only if \ does not exceed the critical 
value A=2 belonging to a Gaussian distribution (the condition 
Saw(x) 20 corresponding to (4) being violated for certain values of x 
if and only if A>2). 

It will turn out that the characteristic condition \ <2 belonging to 
y= must be replaced by A $1 for every finite » 21. In other words, 
if u is arbitrarily fixed on the range 1 <y< ~, the distribution func- 
tion ¢, does or does not exist according as \ <1 or A>1. Incidentally, 
the trivial case (2) happens to be a limiting case, in the sense that (4), 
where \ <1 Sy, can be refined to 


(6) fa(x) > 0, 0<x< 


unless (A, uw) =(1, 1). 

It seems to be of interest that there exists a fixed lower bound 
(equal to 1) for the admissible values of wu in the whole admissible 
range of X (<1). That uw must be limited by some lower bound for 
every given admissible X, is clear from Lévy’s theory of indefinitely 
divisible laws. In fact, if X is arbitrarily fixed, (1) requires that 
dx.) be the wth power of L(t; for every yu, although L(t; 
vanishes for certain values of ¢t. But this contradicts the stochastic 
theory just mentioned. 

Actually, it is easy to see directly that there exists for every \>0 
a sufficiently small 4 >0 such that the corresponding function (3) be- 
comes negative for certain values of x and violates, therefore, (4). In 
fact, it is clear for reasons of continuity that there must exist for every 
X\>0 such a sufficiently small » >0, if it is true that, for every fixed 
X>0, the function (3) becomes negative for certain values of x in the 
limiting case But if then (3) reduces for every to 


1 
fro(x) = f cos xt dt = x~' sin x, 
0 


and attains therefore negative values. 

For sake of simplicity, it will from now on be assumed that yp has 
a fixed value not less than 1, while \>0 is arbitrary. 

It will be shown that (4) is violated or the necessary and sufficient 
condition, represented by (4) and (5) together, is satisfied, according 


? For sharper results see, loc. cit., where references are given to the investigations 
of Lévy and Pélya. 
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as \>1 or AS1; that is, according as the function on the right of (1) 
is or is not differentiable at t=0. 

First, if \>2, the function on the right of (1) is 1+ 0(¢?) as t-0. 
But this implies* that either (4) is violated or L(t; g),) is independent 
of t, and the latter case is excluded by (1). Next, if \=2, then (3) 
reduces to a constant multiple of the classical integral representation 
of the ratio J,41;2(x)/x*t/?; so that (4) is violated in view of the 
asymptotic formula of the Bessel functions for large real x. But a per- 
fectly similar asymptotic formula, again with a sine function in the 
numerator, can be proved‘ for (3) even if A\<2, provided that A>1. 
Accordingly, (4) is violated® whenever A> 1. 

It will now be shown that (4) is satisfied whenever \ <1. To this 
end, it will be sufficient to prove (6), since the limiting case (A, ») 
=(1, 1) is taken care of by (2). 

Partial integration of (3) shows that 


(7) Xfru(x) = ru f (1 — A)?! sin xt dt. 
0 


But, if F,,(¢) is an abbreviation for (1—#)4-'#—", the last integral can 
be written in the form 
(ntl) 


f sin xt dt = 21 (- F,,(t/x) sin t dt 

0 n=0 na/z 

for every x >0. Hence, in order to prove (6), it is sufficient to ascertain 
that the non-negative function F,,(t/x) of t/x is steadily decreas- 
ing as long as it does not vanish; in other words, that the function 
(1—#)*-'P—! of ¢ is steadily decreasing on the interval 0<¢<1. But 
both factors (1—#)#—!, #-! are positive and non-increasing, and at 


3 Loc. cit., p. 80. 

4 Cf. A. Wintner, On the asymptotic formulae of Riemann and of Laplace, Proceed- 
ings of the National Academy of Sciences, vol. 20 (1934), pp. 57-62. 

5 Naturally, the problem is less deep than to necessitate an actual application of 
the sharp asymptotic fact, referred to before. For instance, if 1=1, a partial integra- 
tion transforms (3) into 


1 z 
Su(x) = f P-1 sin xt dt = f P-1 sin dt. 
0 6 


But this function violates (4) whenever \>1, since, if \A—1>0, the successive “waves” 


hes 


P-1 sin tdt = f P| sin ¢| dt, where n = 0, 1,2,---, 
ne 
increase with n. It is easy to transfer this primitive proof from »=1 to every »>1, 
if use is made of the second mean-value theorem. 
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least one of them is decreasing, on the interval 0<#<1. In fact, 
u—120 and A—1<0 by assumption, where at least one of the two 
signs of equality is excluded by (A, u) ¥(1, 1). 

This proves (4) for A\S1<Sy. It remains to be shown that (5) is 
satisfied for \<1<y. To this end, it is sufficient to verify that 


(8) = O(a-*), as x—> 0, 
Suppose first that \=1. Then (7) shows that (8) reduces to 


1 
f (1 — sin xt dt = O(2-), as —> ©, 
0 


But this estimate is clear from the second mean-value theorem. 
In the remaining case, where \ <1, the estimate (8) can be refined 
to the asymptotic formula 
+ A) sin (4/2) 


x 1+ 


(9) ~ » <i. 
In fact, since 0<A<1 implies the existence of the integral 


(10) fw sin ¢ dt = T(A) sin (xd/2), 


it is clear from (7) that (9) is equivalent to the assertion that, as 
x— the function 


ef (1 — sin xt dt 
tends to the limit (10). This function can be written in the form 
(1 — sin ¢ dt, 
0 
and tends therefore to the limit (10) if it is legitimate to carry out the 


process x— © beneath the last integral sign. But this is readily justified 
by the second mean-value theorem. 


THE Jouns Hopkins UNIVERSITY 


= 


EVERYWHERE DENSE SUBGROUPS OF LIE GROUPS 
P. A. SMITH 


A recent note by Montgomery and Zippin' leads one to speculate 
concerning the nature of everywhere dense proper subgroups of con- 
tinuous groups. Such subgroups can easily be constructed. Suppose 
for example that G is a non-countable continuous group which admits 
a countable subset Gp filling it densely. The group generated by Go 
is everywhere dense in G but is not identical with G. In the case of 
Lie groups, it is easy to see that an abelian G admits non-countable 
subgroups of the sort in question; whether or not a non-abelian G does 
so, appears to be a more difficult question. We shall, however, show 
that if G is simple, proper subgroups of G cannot, so to speak, fill G 
too densely. 

Let G be a simple? Lie group of dimension r with r>1, and let U 
be a canonical nucleus of G—that is, a nucleus which can be covered 
by an analytic canonical coordinate system. An arbitrary point x of U 
is contained in the central of at least one closed proper Lie subgroup of 
G with non-discrete central. In fact, through x there passes a one- 
parameter subgroup 7; the closure of y is an abelian Lie subgroup 
and this subgroup is proper since G is simple and r>1. 


THEOREM. Let G be a simple Lie group of dimension r greater than 
one and let g be a proper subgroup filling G densely. There exists at least 
one proper closed Lie subgroup H of G such that those left- (right-) cosets 
of H which fail to meet g fill G densely. For H one may take any closed 
proper Lie subgroup of G whose central 1s non-discrete and contains an 
arbitrarily chosen point pin g(\U, U being any given canonical nucleus 
of G. 


Proor. Let U, p, H be chosen and let us consider only the left- 
cosets of H. It will be sufficient to prove that there exists at least one 
coset, say aH, which fails to meet g. For, the cosets obtained by multi- 
plying aH on the left by arbitrary elements of g fail to meet g and fill 
G densely. 


Received by the editors July 15, 1941. 

1 Deane Montgomery and Leo Zippin, A theorem on the rotation group of the 2- 
sphere, this Bulletin, vo!. 46 (1940), pp. 520-521. Our theorem may be regarded as a 
generalization of the theorem of Montgomery and Zippin and the proofs of the two 
theorems may be regarded as being the same in principle. 

2 We use simple here in the sense of having a simple Lie algebra. A simple group 
need not be connected. 
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Let us assume the contrary, namely that every coset of H meets g. 
Let H* be the totality of cosets of H and let the elements of H* be 
denoted by e*=H, a*=aH,---. Let o be the mapping x—x* 
(x* =xH) of G into H*. Let H* be topologized in the usual way by 
taking as open in H* every set of the form aA where A is an open sub- 
set of G. The space H* is homogeneously locally euclidean.— Now let 
x* be an element of H* and let x be a representative of the coset x*. 
Then xpx—! (where p is defined in the theorem) is independent of x. 
For if y is a second representative of x*, then x~'yCH so that 
x—!yp = since is in the central of H. Hence xpx-!=ypy—'. Thus 
the formula r(x*) =xpx~! defines a mapping 7 of H* into G which, 
in particular, carries e* into p. Evidentally 7 is continuous. In fact 
it is easy to see that 7 is analytic relative to an arbitrarily chosen 
analytic canonical coordinate system %1, --- , x, covering U, and a 
suitably chosen coordinate system covering a neighborhood of e*. 

The mapping f carries H* into a subset of g. For, by, our assumption 
on the cosets of H, an element y* of H* can be written in the form 
y* =gH where g Cg. Hence we have r(y*) = gpg-'Cg.— Moreover, any 
given neighborhood V* of e* contains at least one point x* such that 
t(x*)#p. For otherwise we have r(yH) =p for every y in a certain 
nucleus V of G, that is, for every yin V and hin H we have yhp(yh)-! 
=p or ypy~'=p. But then the one-parameter subgroup of G deter- 
mined by p would be invariant, contrary to the hypothesis that G 
is simple. 

Let W bea nucleus of G such that W-1WW CU. It follows from the 
last two paragraphs that there exists in H* a point z* near e* such that 
the linear segment e*z* is carried by 7 into an analytic arc contained 
in g(\W and consisting of more than a single point. A suitably chosen 
piece of this arc, when multiplied on the left by the inverse of one of 
its points, furnishes an analytic 1-cell K contained in g/\W and con- 
taining e, the identity of G. Starting with K we shall construct a di- 
mensionally increasing sequence of analytic continua, subsets of g. In 
what follows, let it be understood that all functions are real, single- 
valued and analytic over the domains indicated. 

We may suppose that K is defined parametrically, say by x;=f;(t) 
where —1<t<1 and f(0) =e. The set KK is in g and is defined by 
equations of the form x;=g;(s, t) where —1<s, t<1. Suppose that 
dim KK >dim K;; that is, suppose dim KK =2. Then being an ana- 
lytic locus, KK contains points at which it is locally euclidean 2-di- 
mensional. If 6 is such a point, then b-!'KK (a subset of g) is locally 
euclidean at e. Hence g/\W contains a 2-cell Ke defined say by 
xj=h(u, v) where —1<u,v<1 and A(0, 0) =e. We next consider the 
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set K2K» and suppose that its dimension exceeds that of K2. On con- 
tinuing in this manner, we finally obtain a k-cell E in g(\W defined 
say by xi=hi(u, -- - , ux) where —1<u;<1and A(0, - - - , 0) =e, and 
such that dim EE=dim E=k. We assert that E contains subsets E* 
and F such that (1) E* and Fare k-cells; (2) eC FCE*; (3) FFCE*. 

To prove this, we first note that by the theory of implicit functions, 
E contains a k-dimensional sub-cell E* definable, after renaming the 
coordinates x; if necessary, by equations 


(1) xi = Xx), i=k+1,---,y, 
where (x1, - -- , xx) ranges over the cube C;: —5<x;<6, and where 
X;(0,---, 0)=e;=0. On replacing 5 by a smaller number if neces- 
sary, it is easy to see that C; contains a cube C,: —p<x;<yp 
(t=1,---, h) such that if F is the k-cell defined by (1) with 
(x1, -- + , Xz) restricted to the cube C,, and if g is an arbitrary point 
of F, then qf, like F, is definable by equations of the form (1): 
where (x1, -- - , x4) ranges over a certain open subset A‘ of C;. Now 


EE is the union of k-cells gE (¢CE), hence is k-dimensional at every 
point. Being an analytic locus, the points g at which EE is locally 
euclidean k-dimensional fill it densely. Consider such a point g. The 
k-cells F and gF intersect at g. But since both are contained in EE 
which is locally euclidean k-dimensional at g, they coincide identically 
in the neighborhood of g. Hence the functions X; and X{ are identi- 
cally equal over an open subset of A?; hence, by the theory of analytic 
functions, they are equal over the whole of A*. Hence gFCE*, and 
this is true for a set of points g filling F densely. By continuity this 
relation holds for arbitrary gin F. Hence FFCE*%, proving our asser- 
tion. 

It is easy to see that on replacing F by a smaller k-cell if necessary, 
we have also F-!'CE*. In short F is a k-dimensional local Lie sub- 
group of G; hence it is an open subset of a k-dimensional linear sub- 
space L of the linear space of the canonical coordinates x, - - - , xy. 
If k<r, there exists in W an element a such that the linear subspace 
L’ determined by F’ =aFa~' is different from L; otherwise the Lie 
subalgebra represented by L is invariant. Since g is everywhere dense 
in G, we may assume, so far as the relation L¥L’ is concerned, that 
aCg. Then FF’ Cg. Moreover, it is evident that dim FF’>k. We can 
now repeat the construction described above starting with a suitably 
chosen analytic cell of dimension exceeding k in FF’. We obtain 
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finally an analytic r-cell contained in g/\W. Hence g contains a nu- 
cleus of G and hence g =G, a contradiction which proves the theorem.* 


CoL_umBIA UNIVERSITY 


3 We have proved, incidentally, that if an everywhere dense subgroup g of a simple 
Lie group G, (r>1) contains an analytic arc, then g=G. 


VECTOR SPACES OVER RINGS 
C. J. EVERETT! 


1. Introduction. Let Mt=u,K-+ --~- +u,,K be a vector space (lin- 
ear form modul [5, p. 111]) over a ring K = {0, a, B, - - - ; € unit ele- 
ment}. By a submodul N<M is meant an “admissible” submodul: 
NK Elements 1, ---, v, of a submodul form a basis for N 
(notation: N=1K+--- in case implies a;=0, 
t=1,---, m, and if every element of 9 is expressible in the form 
Dv, a;€K. The equivalent formulations of the ascending chain 
condition for submoduls of a vector space, and for right ideals of a 
ring will be used without further comment [5, §§80, 97]. 


2. Basis number, linear transformations. We remark that the fol- 
lowing holds. 


(A) The ascending chain condition is satisfied by the submoduls of a 
vector space I over K if and only if it is satisfied by the right ideals of K. 


An infinite chain of right ideals r1<1t2< ---~- in K yields an infinite 
chain of submoduls u:11<ut2< --- in Jt. The other implication is 
proved in [5, p. 87]. 

[By using a lemma due to N. Jacobson (Theory of Rings, in publica- 
tion) Theorem (A) and the corresponding theorem for descending 
chain condition are easily proved in a unified manner. | 

Linear transformations of Dt on M are given by uj;—uf =) ujai;. 
Write (uy,---, tm)A, A=(aij). Under uj, let 
Mo 0. Thus M/MoLMA = M. Clearly if and only if Av=0 
implies v=0, v an mX1 matrix over K, and JA =M if and only if 
there exists an mXm matrix R with AR=I, the identity matrix. 

Possibilities (i) and MA=M; (ii) Mo>O and MA <M; 
(iii) Mo=O0 and MA <M are familiar. The possibility of (iv) Dt» >0 


Presented to the Society, September 5, 1941; received by the editors May 27, 1941. 

1 The results presented here were obtained while the author was Sterling Research 
Fellow in mathematics, Yale University, 1940-1941. Thanks are due to Professors 
Oystein Ore, R. P. Dilworth, and the referee for helpful suggestions. 
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and IA =M is demonstrated later in (D), thus settling a question 
raised by van der Waerden [5, p. 115]. 

Case (iii) implies an infinite descending chain in MM, case (iv) an 
infinite ascending chain in M. 


(B) The set (1, --- ,n)=(us1,---, Um)A, n<m, forms a basis for 
M=umK+ --- +unK if and only if them Xm matrix (AO) has a right 
inverse: (A0)R=I, and Av=0 implies v=0, v an nX1 matrix over K. 


This is an immediate consequence of the basis definition. 


(C) If the right ideals of K satisfy the ascending chain condition, 
every basis of a vector space Mt=uK+ --- +uUmK has m elements. 


For a matrix (A0) of the type in (B) defines a linear transformation 
of type (iv) violating the chain condition in K. 

Hence with every vector space 2% over a ring K with ascending 
chain condition for right ideals is associated a unique basis number 
b(M). K a quasi-field is a trivial special case. 


(D) If K is the ring of all infinite matrices over a field, with only a 
finite number of nonzero elements in each row and each column, then the 
vector space M=mK+ --- +umK, m>1, has a basis of one element: 
M=uK. Thus there exist, for arbitrary m, 1 Xm matrices (an, -- - ,Qm), 
(B:,---, Bm) over K such that (o1,---, Bm) =I, the 
m Xm identity matrix, with aB=0,i=1,---,m,8€K implying B=0.? 


Let 5; be the vector (0, 0,---, 0, 1, 0,---)’ with 1 in the 
ith position from above. Matric elements of K are defined by 
their column vectors; let the unit of K be e=(6:, 6:,---) and 
a= (0, 0, 52, 0, = (61, 0, be, 0, 53, 0, bu, 


Let 
a2 0 0 0 
Then AB=I, and a:18 =a28 =0 implies B=0, BEK. Let 
I 0 
4) 
0A 


where J is the (m—2) X(m—2) identity matrix. It follows from (B) 
that m1, ---, Um—2, v form a basis for Mt, where (uw, - - - , Um—2, 0) 


2 A’ means A transpose. 
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=(u1, --- , Um)A1. The induction is obvious, and 2 has a basis of a 
single element. The theorem follows from (B). 


3. Vector spaces over right principal ideal rings. We now remark 
that the following holds: 


(E) If M=mK+ --- +u,K is a vector space over a ring K in which 
every right ideal r>0 is of type poK, where ppx=0, aC K implies a=0, 
then every submodul N, O<N=M, has a basis of n elements, n=m. 


This is only a trivial modification of the van der Waerden result 
[5, pp. 88, 121], appropriate since the condition subsequently also ap- 
pears to be necessary (see (F)). 


Lemna 1. If every submodul N, O0<NLM=umK+ --- +unK has 
a basis of n<m elements, and t is a right ideal of K,0<t<K, then the 
submodul N=urU - - - Uunr, consisting of all sums pi€t, has 
basis un, Um with i=1,---, m, and uy ts a basis 
for ux. 


For 0<ut=unK+--- ---Uuar 
is a submodul for which the u,; together form a basis of }; elements. 
The hypothesis of the lemma implies the ascending chain condition 
in I, and hence in K (by (A)). Hence by (C) the basis number for 
MN is unique and n;=m, n;=1,1=1,---,m. Thus ut=uy-K. 


(F) Let M=uK+ --- +u,K be a vector space over K. Then every 
submodul N, O<NSM, has a basis of n=m elements, if and only if 
every right ideal t>0 in K is of type poK, where ppxa=0, aC K, implies 


a=0. 


For if r>0 is a right ideal of K, by the lemma, mt =uyK, u11.=U1po, 
poCr. Then and r=poK. Moreover pow =0 implies una =0 
and a=0. 

Now suppose I{=u,K+ - -- +u,K is a vector space over a ring 
K of the type in (F). To every submodul 0, 0<N < M, corresponds a 
unique basis number Define b(0) =0. 


(G) If M=umK+ --- +u,K is a vector space over a ring K of the 
type in (F), the basis number b(N), OSF NLM, is a positive modular 
functional [1, p. 40]: 

M1. (AUB) +b(ANB) =b(H) +4(B), 

M2. implies b(A) <d(B). 


M2? is clear from (F). A proof of M1 may be made by induction on 
b(M). We treat here only the following case: 
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Let K be a (noncommutative) domain of integrity in which every 
right ideal is principal.* The vector space M=u:K+ --- +unK may 
then be regarded as imbedded in the vector space @*=u:K+ - - - 
+umK where K is the quotient quasi-field of K. The existence of K 
follows from theorems developed by Ore [3, p. 466] and a proof by 
Teichmiiller [4] that the least common multiple of nonzero elements 
in such a K is not zero. The correspondence 


(y) 


is a well-defined correspondence on the lattice LZ of all K-submoduls 
of M to the entire lattice Z of K-submoduls of M*, (since N* is inde- 
pendent of the Jt-basis). Observe that b(2) =b(MN*) as a submodul 
of I*. For the K-independence of a basis (, - - - , v,) of N implies the 
K-independence of 1, - - - , ¥,: Let =0, a;=a;/B;EK (Ore quo- 
tient) ; if 4 is the (nonzero) least common multiple of the§;, Doe =0, 
and ay:CK by the Ore theory referred to. Hence ay:=0, and a;=0, 
4=1,---,n. 

It is trivial to verify that: 

(1) implies A* >B*. 

(2) (WUS)* =A*UB*. 

(3) (ANB)* =A*NB*. 

For example, in (2) (A4\.U%)* = A*\UB* follows from (1). But every 
element in (%.U)* is a K-form in a K-basis of AU, hence is in 
A*\UB*. Since b(A*) is the dimension of A* over K, it follows that 
b(M) is a positive modular functional on L. 

We may now apply the theory of such functionals [1, p. 42, Theo- 
rem 3.10] to show that 6(%, B) =b(A%UB) —b(ANB) is a quasi-metric 
on L: 

(4) 6(A,B)=0, A) =0. 

(5) B)+6(%, © = 6(A, ©). 

The relation A~ defined by 5(A, B) =0 is an equivalence relation, 
and the correspondence A—[A], the equivalence class containing Y, 
is a lattice homomorphism of L onto the metric lattice L’ of equiva- 
lence classes. For want of a name, we call L’ the metric homomorph 
of L. However, in the correspondence (y), A* =B* if and only if A~B. 
For, if A~B, (AUB) =b(ANB), and A*UB*=A* =B* = A*MNB*, 
since all these have the same dimension over K. Conversely, if 


A* =B*, then (WUB)* =A* = (ANB)*, b(AUB) =b(ANB) and A~B. 
(H) If K is a right principal ideal domain of integrity, quotient field 


3 For the elementary divisor theory of matrices over such domains, and references 
to the literature, see [2]. 
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K, then the basis number b(t) is a positive modular functional on the 
lattice L of submoduls of M=u:K+ --- +tmK, and the metric homo- 
morph L' of L is lattice isomorphic with the lattice of submoduls of 
M*=uK+ --- +unK. 


4. Vector spaces over quasi-fields. We now typify vector spaces 
over quasi-fields by (I) and (J). 


Remark. A ring K={0,a, -- - } with unit €, whose only right ideal 
t>0Ois K, ts a quasi-field. 


Let a¥0. Then 0<aK=K, aB=e. The right annihilator (right) 
ideal r of @ is (0), for r>0 implies r=>K, and ae=a=0. Hence 
a(Ba—€) =aBa—a=a—a=0 and Ba=e. 


(1) Let M=uK+ --- +u,K bea vector space. Then every submodul 
N, O<N=IM, has a basis of n<m elements, with NR <M implying n<m, 
af and only if K is a quasi-field ; that is, the modular functional b(N) ona 
vector space over a ring K of the type in (F) is sharply positive [1, p. 41] 
if and only if K is a quasi-field. 


These are well known properties of a vector space over a quasi- 
field. If they hold, then by Lemma 1, the existence of a right ideal r, 
0<r<K implies N=mrVU -- - Uuar<M with b(M) =b(M), contrary 
to hypothesis. Hence (I) follows from the remark above. 


(J) Let M be a vector space over a ring K of the type in (F). Then M 
satisfies the descending chain condition if and only if K is a quast-field. 


For rings of this type, the descending chain condition in It and 
sharp positiveness of are equivalent. If &<B with b(A) =(B), 
the transformation of 8-basis into %-basis is of type (iii), on B. 
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PSEUDO-CONFORMAL GEOMETRY: FUNCTIONS OF 
TWO COMPLEX VARIABLES 


EDWARD KASNER AND JOHN DECICCO 


1. Introduction. The theory of functions of a single complex vari- 
able is essentially identical with the conformal geometry of the real 
(or complex) plane. However, this is not the case in the theory of func- 
tions of two independent complex variables. Any pair of functions of 
two complex variables induces a correspondence between the points 
of a real (or complex) four-dimensional space S,. The infinite group G 
of all such correspondences is obviously not the conformal group of S,. 
Poincaré in his fundamental paper in Palermo Rendiconti (1907) has 
called G the group of regular transformations. In an abstract pre- 
sented to the Society, 1908, Kasner found it more appropriate to term 
it the pseudo-conformal group G. 

In a preceding paper, Kasner has given a purely geometric charac- 
terization. His main result is that the pseudo-conformal group G is 
characterized by the fact that it leaves invariant the pseudo-angle between 
any curve and any hypersurface at their point of intersection.* 

In the present work, we shall find a// the differential invariants of 
first order between the curves, surfaces, and hypersurfaces at a given 
point under the pseudo-conformal group. We shall take every com- 
bination of any two elements—six possible cases.? The number of in- 
dependent invariants may be 0, 1, or 2. 

A general pair of curve elements possesses no invariants. However, 
in the special case of an isoclinal pair, there is a unique invariant (the 
angle between them). A similar result is true for two hypersurface ele- 
ments. 

A hypersurface element and a curve element possess only one in- 
variant—the pseudo-angle.' 

To any general surface element S, there is associated a quadric 
regulus R of curve elements. There are no invariants between a gen- 
eral surface element S and a curve element e which is not on the 
regulus R of S. On the other hand, if e is in R, then there is a unique 


Presented to the Society, September 5, 1941; received by the editors July 19, 1941. 

1 Kasner, Conformality in connection with functions of two complex variables, Trans- 
actions of this Society, vol. 48 (1940), pp. 50-62. See also the following paper: Kasner, 
Biharmonic functions and certain generalizations, American Journal of Mathematics, 
vol. 58 (1936), pp. 377-390. 

? We sha!l denote by e a curve element, that is, a lineal element; by S a general 
surface element; and by = a hypersurface element. The six possible cases are (e, é), 
(e, x), (x, x), (e, S), (x, S), (S, S). 
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invariant between e and S. We find similar results by substituting 
hypersurface element 7 for curve element e in the preceding state- 
ments. 

Finally, two general surface elements possess two independent in- 
variants. 


2. The minimal coordinates. Let (e€, n, ¢, £) denote the cartesian 
coordinates of any point of complex euclidean four-space S,. In our 
work, we shall find it very convenient to introduce the minimal co- 
ordinates (x, y, u, v) defined by 


(1) x=e+m u=e-m, 
The inverse of this correspondence is 
2 2% 2 2% 

In minimal coordinates, the distance ds between any two nearby 
points of space is 
(3) ds? = dxdu + dydv. 
If @ is the angle between any two directions of space, then 


+ dxedu; + dyidve + dy2dr 


(4) cos = 
2[(dxidus + + dyedve) 


3. The pseudo-conformal group This is given in minimal coordi- 
nates by 


(5) X = X(z, 9), Y = F(z, 9), U = U(u, 2), V = V(u, 2), 


where X,Y,—X,Y.40 and U,V,—U,V.,#0. Our problem is to in- 
augurate the study of the geometry of this group. 

In what follows, we shall omit from consideration the special mini- 
mal planes x =const., y=const., and “=const., v=const. Obviously 
our pseudo-conformal group may be defined as that preserving the 
2 ©? special minimal planes. 


4. The pseudo-conformal geometry of differential elements of first 
order. In this and the following sections, we shall mainly be interested 
in the geometry of the curve elements at a fixed point of S,. Obviously 
these form a three-dimensional manifold 23. 


3 We shall study only the continuous pseudo-conformal group, not the more ex- 
tensive mixed group obtained by adjoining functions of the conjugate complex vari- 
ables. 
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The pseudo-conformal group induces the seven-parameter group 
G; among the ~? curve elements of 2; at a given point, defined as 
follows 


6) pX’ = ax’ + by’, pU’ = au’ + Br’, 


pY’ = cx’ + dy’, pV’ = yu’ + ov’, 


where ad —bc 0 and ad —By £0. Note that (px’, py’, pu’, pv’) are pro- 
portional to the four differentials (dx, dy, du, dv). 


5. The isoclinal surface elements. A surface element of 2; is said 
to be an isocline if it is given by two equations of the form 


(7) = Au /ul 


Otherwise a surface element mot an isocline is called a general surface 
element. There are ? isoclines among the © ‘ surface elements of 23. 
There is a unique isocline through any given curve element. There- 
fore two isoclines never possess a common curve element. 
By applying (6) to (7), we find that amy tsocline becomes an isocline 
under G7. The group of correspondences between the isoclines is the 
six-parameter set 


uXy = axi + by, vUi = + Boi, 
uY{ + dyi, wWi = yur + 


6. The invariants of two curve elements of 2;. In the first place, 
it is observed that the most general transformation of G7; which will 
carry the curve element (1, 0, 1, 0) into the curve element (1X7, 
pi Yi, p1U1’, pi Vi ) is of the form 


px’ = piXix’ + by’, pU’ = p,Uiu' + Br’, 
pY’ = x’ + dy’, pV’ = pViw + dv’. 


Under G;, any curve element of 2; may be carried into any other 
curve element of 23. The canonical form of any curve element is 
(1, 0, 1, 0). There are « * such transformations. 

A pair of curve elements is said to be an isoclinal pair if both ele- 
ments lie in the same isocline. Otherwise it is said to be a general pair. 
The conditions necessary and sufficient for an isoclinal pair are (7). 

Any curve element which lies in the same isocline with (1, 0, 1, 0) 
must be of the form (x/, 0, y/, 0). The transform of this under the 
correspondence (9) is 


(8) 


(9) 


(10) = piXi xz, = piUi uz, 
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These equations immediately guarantee that the transformed element 
(Xi, YZ, Uz, Vi) is in the same isocline with (X/, Yi, Ui, Vi). 
Moreover the expression 


(11) Xi/Xi-Ui/Ui = Yi/Yi-Vi/Vi 


is invariant. By taking the logarithm of this invariant and then multi- 
plying the result by 1/24, it is found by (4) that the resulting invariant 
represents the angle between the isoclinal pair of curve elements. 

Next let us observe that the group which preserves the curve ele- 
ment (1, 0, 1, 0) is 


(12) pX’ = pix’ + by’, pY’ = dy’, pU’ = pw’ + Br’, pV’ = iv’. 


This group demonstrates that the curve element (x’, 0, u’, 0) is 
preserved. Hence we find the following proposition to be true. 


THEOREM 1. An isoclinal pair of curve elements possesses the unique 
invariant 


13) = — lo 
(13) 2i xi up 2% yi 


This actually is the angle 0 between the two directions. 


The canonical forms of any isoclinal pair of curve elements are 
(1, 0, 1, 0) and (x’, 0, u’, 0). There are ©‘ such transformations. 

Next any transformation of the group (12) which carries the curve 
element (0, 1, 0, 1) into the curve element (92X72, p2¥2, p2U2, p2V2 ) 


is of the form 
(14) pX’ = pix’ + y’, pU’ = pw’ + 
pY’ = y’, pV’ = 


This demonstrates immediately that any general pair may be sent 
into any other general pair of curve elements. 

The group which preserves the general pair of curve elements 
(1, 0, 1, 0) and (0, 1, 0, 1) is 


(15) pX’ = pix’, pY’ = pry’, pU' = pw’, pV’ = px’. 


THEOREM 2. Any general pair of curve elements may be converted into 
any other general pair. 


7. The nonexistence of invariants for an isocline paired with an 
isocline or a curve element. By (6), it may be proved that any isocline 
may be carried into the isocline x’=0, u’=0. The subgroup of (6) 


| 
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which preserves this isocline x’ =0, u’ =0 is, in curve element coordi- 
nates, 


(16) pX’=ax’, pY’=cx'+dy, pU' pV’ = + ov’. 


Any isocline may be carried into any other isocline. The canonical 
form of an isocline is x’ =0, u’ =0. There are ©* such transformations 
of G; which carry a given isocline into a prescribed isocline. 

By (16) it may be proved that any isocline distinct from the iso- 
cline x’=0, u’=0 may be carried into the isocline y’ =0, v’=0. The 
group which preserves the two isoclines x’ =0, u’ =0, and y’ =0, v’ =0 
is 


(17) pX’ = az’, pY’ = dy’, pU' = an’, pV’ = dv’. 


THEOREM 3. Any two distinct isoclines may be carried into any other 
two distinct isoclines. 


The canonical forms of two distinct isoclines are x’ =0, u’ =0; and 
y’ =0, v’=0. There are ~ * such transformations of Gy. 

By combining (9) and (16), it is found that any curve element not 
on the isocline x’=0, u’=0 may be carried into the curve element 
(1, 0, 1, 0). The group preserving the isocline x’=0, u’=0 and the 
curve element (1, 0, 1, 0) is 


(18) pX’=pix’, p¥’=dy’, pU’=pw', pV’ = by’. 


THEOREM 4. An isocline y and a curve element e not in y can be con- 
verted into any other isocline T and a curve element E not inT. 


The canonical forms of an isocline y and a curve element e not in y 
are y:u’=0, x’=0; and e(1, 0, 1, 0). There are ©? such transforma- 
tions of G7. 

It may be proved that any isocline y and a curve element e in y 
can be carried into the isocline x’=0, u’=0 and the curve element 
(0, 1, 0, 1). The group which preserves the isocline x’ =0, u’ =0 and 
the curve element (0, 1, 0, 1) is given by (16). 

An isocline y and a curve element e in y can be carried into any 
other isocline T and a curve element E in I. 

The canonical forms of an isocline y and a curve element ¢ in y are 
y:u'=0, x’ =0; and e(0, 1, 0, 1). There are © 4 such transformations. 


8. The invariants for a hypersurface element paired with a curve 
element, or hypersurface element, or isocline. Any hypersurface ele- 
ment of 2; is 


(19) kx! + ly’ + mu’ + nv’ = 0. 


Any hypersurface element may be given by 1(ck, ol, om, on). 
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Note that & and I, or m and n, cannot both be zero as hypersurface 
elements which contain the two minimal planar directions x’ =’ =0, 
or u’ =v’ =0 are excluded from consideration. 

The induced group between the hypersurface elements of 2; is 


ok = aK + cL, om = aM + yN, 


(20) 
ol=bK+dL, on= 6M +N. 


Let a hypersurface element r(k, 1, m, m) and a curve element 
e(x’, y’, u’, v’) be given, (e not in 7). Upon drawing the isocline y 
through e, we find that y intersects 7 in the curve element 


X'/x’ = Y'/y' = — w(mu' + nv’), 


(21) 
U'/ul = = w(kx’ + ly’). 


By Theorem 1, we know that the angle between e and the above curve 
element is invariant. 

By (12) and (20), we see that the group which preserves the curve 
element (1, 0, 1, 0) may be written in hypersurface element coordi- 
nates as 


(22) ockh=piK, ol =bK+dL, om=p,M, on = BM + 56N. 


By these equations, it may easily be proved that the hypersurface 
element (ck, ol, om, on) can be sent into the hypersurface element 
II(K,0, M, 0). The group which preserves the curve element e(1, 0, 1, 0) 
and the hypersurface element 7(k, 0, m, 0) is 


(23) ok = pik, ol = dL, om = piM, on = ON. 


THEOREM 5. A hypersurface element x and a curve element e (e not 
in ©) possess the unique invariant 


1 mu’ + nv’ 
(24) = | - 
2i kx’ + ly’ 


This is the angle between e and the unique curve element e* in x such that 
e and e* form an isoclinal pair. This is identical with Kasner’s pseudo- 
angle. 


The canonical forms of a hypersurface element 7 and a curve ele- 
ment e (e not in 7) are m: kx’+mu’=0 and e(1, 0, 1, 0). The pseudo- 
angle is then (1/27) log (—m/k). There are ~? such transformations. 

Any hypersurface element contains a unique isocline. Thus the iso- 
cline in the hypersurface element 2(k, 1, m, n) is 


(25) kx’ + ly’ =0, mu’ + nv’ = 0. 
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A hypersurface element 7 and a curve element e contained in x but 
not in the isocline of 7 can be carried into any other hypersurface ele- 
ment II and a curve element E contained in II but not in the isocline 
of IT. 

The canonical forms of a hypersurface element 7 and a curve ele- 
ment e contained in 7 but not in the isocline of 7 are 7: x’ =u’ and 
e(1, 0, 1, 0). There are «©? such transformations. 

A hypersurface element 7 and a curve element e contained in the 
isocline of 7 can be sent into any other hypersurface element II and 
a curve element E contained in the isocline of II. 

The canonical forms of a hypersurface element 7 and a curve ele- 
ment e contained in the isocline of x are 7: y’+v’ =0 and e(1, 0, 1, 0). 
There are « * such transformations. 

A pair of hypersurface elements are said to be an isoclinal pair if 
they intersect in an isocline. Otherwise they are said to be a general 
pair. 

In hypersurface element coordinates, the equations of an isocline 
are 


(26) k/ky = m/m, = n/m. 


Of course, these are the conditions that the hypersurface elements 
(Ri, 11, m1, m1) and (k, 1, m, n) form an isoclinal pair. 


THEOREM 6. An isoclinal pair of hypersurface elements 7 and 72 
possess the fundamental invariant 


1 
27 6 = — lo —-— = — lo _-s 
2% 21 le nN 


This is Kasner’s pseudo-angle between m2 and any curve element of 7. 


The canonical forms of an isoclinal pair of hypersurface ele- 
ments are x’+u’=0 and kx’+mu’=0. The invariant (27) is then 
(1/22) log (m/k). There are « 4 such transformations. 


THEOREM 7. In the other possible pairs of elements, no further in- 
variants are found. 


On examination, we find the following standard forms: 

The canonical forms of a general pair of hypersurface elements are 
x’+u’=0 and y’+v’=0. There are 1 such transformations. 

The canonical forms of any isocline y and a hypersurface element 
™, not in 7, are y:y’ =0, v’ =0, and m:x’+u’=0. There are ©? such 
transformations. 

The canonical forms of an isocline y and a hypersurface element 
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w, y in m, are y:y’=0, v’=0, and r:y’+v’=0. There are 4 such 
transformations. 


9. The invariants for a general surface element paired with a curve 
element, or a hypersurface element, or a general surface element, 
or isocline. A surface element, not an isocline, may be given by w’ 
and v’ expressed as linear functions of x’ and y’; or by x’ and y’ ex- 
pressed as linear functions of u’ and v’. Because of the symmetry of 
our group (6), we find that it always can be supposed that a general 
surface element can be written in the form 


(28) = px’ + qy’, + sy’. 


Since we wish to omit from consideration those general surface ele- 
ments, which contain a curve element of the minimal planar direc- 
tions x’=y’=0, or u’=v'=0, we find that ps—gr+0. 

The correspondence between the general surface elements is 


(ad — By)p = adP + cdQ — aBR — cBS, 
(ad — By)q = b5P + diQ — — ABS, 
(ad — By)r = — ayP — cy0 + aaR + aS, 
(ai — By)s = — byP — dyQ + baR + daS. 
Let S(p, g, 7, s) be a given general surface element. Through each 
curve element of S, there is a unique isocline. There are, therefore, 


« ! isoclines which intersect a given surface element S. These isoclines 
generate a nondegenerate quadric regulus R, which is given by 


(29) 


(30) 


Let e(x’, y’, u’, v’) be a curve element of the regulus R but not in S. 
The isocline through e will intersect the general surface element S in 
the curve element e* given by 


(31) = Y'/y' = ou’, = V'/0' = + gy’). 


By Theorem 1, the angle between e and e* is invariant. 

Since any general pair of curve elements may be carried into any 
other general pair, it follows that any general surface element may be 
sent into any other general surface element. In particular, we may 
carry any general surface element into the general surface element 
u'=x',v'=y’. 

Let us find the subgroup of G; which leaves the general surface ele- 
ment u’=x’, v’=y’ invariant. By (29), we must have 


us px’ + gy’ 
| | 
v rx + sy 
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(32) ab — By = a — B= —by+da, 0= — dB = — ay+ ca. 


By these equations, it follows immediately that a=a, B=b, y=c, 
5=d. Thus the subset of G1, which leaves our general surface element 
u’'=x',v' =y’ fixed ts the three-parameter group in curve element coordi- 
nates 


(33) pX’=ax'+by’, pV’=cu'+dr’. 


Now there exists a transformation of G; which carries our general 
surface element into S:u’=x’, v'=y’. Therefore our curve element is 
carried into a curve element which belongs to the regulus of S. Hence 
the coordinates of this new curve element satisfy the condition 
u'/v' =x'/y'. Now in (33) take —c/d=y’/x’ =v'/u’. Obviously this 
transformation will take our curve element into the curve element 
(x’, 0, u’, 0). 


THEOREM 8. Let the curve element e(x’, y’, u’, v') be on the regulus R 
of the general surface element S:u’ = px'+qy’, v' =rx'+sy’. That is, let 
(u’/v’) =(px'+qy’)/(rx’+sy’). Then e and S possess the unique in- 
variant 


(34) 9 


1 


= — log ————_ = — log —_—__ 
2% px’ +qy rx’ + sy’ 


This ts the angle between e and the curve element e* on S such that e and e* 
are an tsoclinal pair. 


The canonical forms of a curve element e and a general surface ele- 
ment S, e on the regulus R of S but not in S, are e(x’, 0, u’, 0) and 
S:u' =x’, v’=y’. There are ©? such transformations. The invariant 
(34) is then 6 = (1/22) log (u’/x’). 

Any pair consisting of a general surface element S and a curve ele- 
ment e contained in S can be converted into any other such pair. 

The canonical forms of a general surface element S and a curve ele- 
ment e contained in S are S:u’ =x’, v’=y’ and e(1, 0, 1, 0). 

Next let us note that the transformation (33) where 

a=" U1), Sey + UY), 
21 2 
(35) 
2i 2 
will carry the curve element (7, 1, —7, 1), not in the regulus of the 


general surface element S:u’=x’, v’=y’, into any curve element 
piYi, piUs, pi Vi), not in the regulus of S. 


1 po 
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We also note that the only transformation which preserves the gen- 
eral surface element u’ =x’, v’ =’ and the curve element (i, 1, —7, 1) 
is the identity. 


THEOREM 9. There exist no invariants between any curve element e 
and a general surface element S such that eis not on the regulus R of S. 
There exists one and only one transformation of G; which will carry any 
such general pair into another general pair. 


Let (x’, y’, u’, v’) be the coordinates of the curve element of con- 
tact between the tangent hypersurface 7(k, 1, m, n) and the regulus R 
of the general surface element S. Then 


(36) pk = ru’ — pv’, pl = su’ — qu’, pm = rx’ + sy’, pn = — px’ — qy’. 
Solving these equations for (x’, y’, u’, v’), we find 
(37) ox’=—mg—ns, ou'=pl—kq, cy'=pm+rmn, ov =—ks+rl. 


The angle between this curve element e and the curve element e* in S 
such that e and e* are an isoclinal pair is invariant. The preceding 
equations in conjunction with Theorem 8 yield the following informa- 
tion. 


THEOREM 10. Let a hypersurface element x(k, 1, m, n) be tangent 
to the regulus R of the general surface element S:u'=px'+qy’, 
v'=rx'+sy’. The condition necessary and sufficient for this is (—n/m) 
=(kqg—pl)/(ks—rl). Then x and S possess the unique invariant 


1 gk — pl 1 sk — rl 
(38) = — log | —————_ | = — log| — ————_— I. 
2i n(ps — qr) 2i m(ps — qr) 
This is the angle between the curve element e of contact between the hyper- 


surface element x and the regulus R, and the curve element e* on the gen- 
eral surface element S such that e and e* form an isoclinal pair. 


Again by Theorem 9 and equations (37), we deduce the following 
theorem: 


THEOREM 11. There exist no invariants between a hypersurface ele- 
ment © and a general surface element S with x not tangent to the regulus 
R of S. There exists one and only one transformation of Gz which will 
carry any such general pair into another general pair. 


The canonical forms are S:u’=x’, v’=y’, and m:ix’+y’—iu'+0' 
=0. 

Let 8 be the angle between any curve element e of the regulus R; of 
the general surface element S,; and the curve element e* on S; such 
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that e and e* are an isoclinal pair. Upon setting ¢=e?”, we find by 
Theorem 11 that the regulus R; may be given parametrically by 


(39) t+qny), = + sry). 


Now the general surface element S2 will intersect R; in two curve 
elements e; and ¢;, given by 


ul = o(pix’ + gry’) = pox’ + gay’, 
= (rix’ + sry’) = rex’ + Sey’. 


The elimination of x’ and y’ gives a quadratic equation in ¢. The 
roots of this quadratic equation are therefore invariants of the two 
general surface elements S; and Sz. 

Now we shall show that these two invariants are the only ones. In 
the first place, we may assume that S; is of the form u’=x’, v’=y’. 
Then from (40), we find that our invariants are the roots of 


(41) — + + (poss — = 0. 


Since the sum and product of these roots are invariant, we know that 
(p2-+52) and (f252—72g2) are both invariants. 

Next we shall show that we can carry our surface S2(p2, ge, 72, S2) 
into (P2, 0, 0, S2). In the first place, the two invariants of the preced- 
ing paragraph yield the following two conditions 


(42) P2+S2 = + 52, P2S2 = poS2 — 1292. 


The elimination of s2 yields the condition 


(40) 


(43) bs — (P2 + S2)p2 + + roq2 = 0. 


Let us in the first place assume that our general surface element Sz 
is contained in the regulus R; of S:. Then S; must be of the form 
(p2, 0, 0, p2) which is the desired form. As a matter of fact, it is seen 
that our group (33) preserves any general surface element S: con- 
tained in Rj. 

Now let us suppose that our general surface element S2 is mot con- 
tained in the regulus R;. Then its transformed surface element 
(Pe, 0, 0, Se) is not contained in R; so that P2¥ So. 

The group (33) shows that if (p2, ge, 72, S2) is carried into 


(P2, 0, 0, Se), then 
(44) (ad — bc)p2 = adPz — bcS2, (ad — bc)re = — ac(P2 — S2), 
(ad = bc)qe = bd(P2 So), (ad bc) se bcP2 + adS>. 


By use of the first of equations (42), it may be shown that the last of 
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the preceding equations is equivalent to the first equation. Hence our 
conditions reduce to 


ad(p2 — P2) = bc(p2 — 
(45) (ad — bc)g2 = bd(P2 — S2), 
(ad — bc)re = — ac(P2 — S2). 
Now in the first of these equations, we see that p2—P2 and p2—S2 


cannot both be zero, for otherwise P2=S2, which is impossible. Hence 
there exists a nonzero number J so that 


(46) ad = Mp2 — S2), ac = — Xr, bc = — bd = Ago. 


From these equations, we deduce that since \ <0, then 

p2 — Pe 

a p2 — S2 q2 

Of course the last equation is true because of (43). Substituting (47) 


into (46), we see that a:b:c:d are determined except for an arbitrary 
constant. 


(47) 


THEOREM 12. Two general surface elements S; and Sz possess the two 
differential invariants o; and $2 which are roots of the equation 


(48) (p1s1 — + (gure + geri — pis2 — + (pose — gore) = 0. 


If we let 0 = (1/22) log ¢, then 0 is actually the angle between the curve ele- 
ment e;, or €;, 1m which Sz intersects the regulus Ri of S: and the curve 
element e* in S, such that e;, or e;, and e* are an isoclinal pair. There 
are no more invariants. 


Finally, we may prove the following result. 


THEOREM 12*. There are no invariants between a general surface ele- 
ment S and an isocline y. Any such pair may be carried into any other. 


In conclusion, we observe that Kasner’s pseudo-angle may quite 
easily be extended to characterize the group of pseudo-conformal 
transformations defined by ” complex functions of m complex vari- 
ables. This group is characterized within the group of arbitrary point 
transformations of a 2m-dimensional space by the preservation of the 
pseudo-angle between a curve and a (2m—1)-dimensional hypersur- 
face at their common point of intersection. 
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ON INTEGRAL FUNCTIONS OF INTEGRAL 
OR ZERO ORDER 


S. M. SHAH 


Let F(z) be an integral function of finite order p. We write 
F(z) =2*e" f(z) where g(z) is a polynomial of degree g <p and 


Doo 


is the canonical product of order p; and genus p. Let M(r, F) 
=mMaxXjz\j=r |F (z)| and n(r, F—a)=n/(r, a) be the number of zeros of 
F(z) —a in |z| =r. In an earlier paper! I proved the following result. 


THEOREM 1. Jf F(z) be of integral order p and if the genus of the 
canonical product f(z) be p=p, then 


, log M(r, F) 
inf = 
r=0 F)9(r) 


where (x) is any positive continuous increasing function of the real 
variable x such that 


0... 
@) x(x) 


is convergent. 


(1) 


In this note I prove a similar result for the canonical products of 
order p and genus p=p—1, and discuss whether the result can be ex- 
tended to integral functions which are not canonical products. The 
main result is the following. 


THEOREM 2. If f(z) is a canonical product of integral order p and 
genus p=p—1 then 


(3) lim inf log M(r, f) = 
n(r, f) P(r) 


where ®(x) is any positive increasing function such that 


a x(x) 
Received by the editors July 20, 1941. 


1 A Theorem on integral functions of integral order, Journal of the London Mathe- 
matical Society, vol. 15 (1940), pp. 23-31. I shall refer to this paper as (1). 
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is convergent and 

(4.1) &(x)/x* 

ts monotonic for all large x, sayx 2A >0;aa constant such that0<a<1. 
1. For all r217(A, B) 


dx Ar'-6 
J= f 4 
4, logr 
where A and A, are positive constants, and B is a constant such that 
0<B<1. 


Proor. From the convergence of the integral in (4), we have 
log x <®(x) for all x Hence for 


ri/2 r 1 2 
na (Ax) (1— logr  logr 
LEMMA 2. Suppose that the real functions (x) and 0(x) satisfy the 
following conditions: 
(1) W(x) is continuous in (6, ©) where 5>0, except for isolated points 
where W(x) has ordinary left-hand discontinuities. 
(2) W(x) is non-increasing as x= 5 increases in any interval between 
two consecutive discontinuities. 
(3) @(x) is a positive continuous increasing function for x= 6. 


(4) lim supy(x) = ~, lim sup = = 0 
x 


Then we can find a sequence {x,} of values of x tending to © such 
that the two inequalities 


O(x) 


are satisfied simultaneously. 
The x, are points of discontinuity so that p(xn) =(xn+0) and x; is 
the first point of discontinuity in (6, ~). 


The proof is similar to that of Lemma 2 of my paper referred to 


above, and is based on the following lemma of Pélya.? 


If 
ly, Ie, ls, ln > 0, 


S1, S2, 53, °° y $1 > 03 > Sm3m = 1,2, 3,---, 


2 Mathematische Annalen, vol. 88 (1923), p. 170. 
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are two sequences of positive numbers, of which the second is monotonic 
and increasing, such that 


lim /,, = 0, lim sup lnsm = ©, 


then we can find an infinite sequence {n} of the indices n such that the 
two sets of inequalities 


vy>n, 
> 
are satisfied simultaneously. 


To prove Theorem 2 we first consider the case when 


(5) Him sup 0, 

We have 

(5.1) < An(r) ®(r) 


for an infinity of values r=R, tending to © and so 
log M(R,) A log M(R,) 
< 


0 asn— o, 
n(Rn) 


Hence 
log M(r) 
inf ———— = 
r=0 
Suppose secondly that 


_ 
T=@ 


(5.2) 0. 


Here ®(x)/x* must be monotonic decreasing, for if not ®(x) 2Ax*, 
and so 


lim su 


= 0, 


® 
n(r) ®(r) n(r) 


contradicting hypothesis (5.2); so ®(x)/x* is monotonic decreasing 
for x 2A. We apply Lemma 2 putting 


n(x) ®(x) 1 


¥(x) = 


and choosing 6(x) =x*, 8 a constant such that 0<8<1—a, 5=A. The 
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conditions of Lemma 2 are satisfied, and hence, putting x, = R we ob- 
tain 
n(x) B(x) n(R) 
Rpti-8 
n(x) ®(2) _ n(R)®(R) 
gett 


» 


for x > R. 


Thus for R>x, 


log M(R, f) < AI(R, f) 
n(x, f) Rr 
(x+ R) 


R xPt2 

aha m(R)®(R) dx + n(R)(R) dx \ 

pen 


{ A(R) ®(R) 
log R 


lA 


+ o(n(R) . 


Hence 
log M I 
n(r, f) P(r) f) 

and this completes the proof of the theorem. 

CoroLiary. If F(z) f(z) is of integral order p and genus 
g=p—1 then 

log M(r, F 
(7) 
T=2 n(r, f) P(r) 

We have g=p—1=max (p, g). It is easily seen that p=p—1, 
gsp—iand 
(8) log M(r, F) < A{re- + log r} + log M(r, f). 
If 

lim su 


n(r, f) P(r) 


then R, being defined in (5.1), 
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log M(Ra,F) +logR,} log M(Rz, f) 
n(R,, F)®(Rn) n(R,, F)®(Ra) n(Rp, f) ®(Rn) 


Hence 


log M(r, F) 
inf ———————_- = 
n(r, F) 
If now 


n(r, f) ®(r) 


0 


then for all large r 
log M(r, F) < A{re-! + log r} + Al(r, f) < Asl(r, f) 


and hence from (6) the required result follows. 

The condition (4) on ®(x) is sufficient but not necessary’ for (3) and 
(7) to hold. The condition (4.1) is also not necessary for we can take 
(x) to be any function 


(11x) (lex) -- + (ix) (Lx), n> 0, 
of the logarithmic comparison scale, and hence any function for which 
#(x) 
lim inf 2A>0. 


(11x) (lex) cee (Lyx) 


We can take ®(x) to be any positive L function‘ which satisfies (4) 
but we cannot take ®(x) (or ¢(x) in Theorem 1) tobe (J1x) (lex) - - - (ix). 
Consider for instance 


where 


a, = — n(lyn) -- - 
an = n(lyn) -- 


The functions f;(z) and f2(z) are canonical products of order 1. The 
genus of f(z) is 0, and of f2(z) is 1. For each of them we have 


log M(r) 


3 Cf. p. 4 of (1). 
‘ For definition see G. H. Hardy, Orders of Infinity, 1924, p. 17. 
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In what follows we shall take $(x) to be a positive LZ function satis- 
fying the condition (2). 

Suppose now F(z) is of integral order p. There are four possibilities: 

(1) (2) 

(3) m=q=p,p=p-—1, (4) p=p,qg<p, p=p—1. 

Combining the above results we have in cases (2) and (4) 

log M(r, F) 

inf —————_ = 
nr, F)$(r) 

In cases (1) and (3), (9) does not hold. For functions of fractional 
order and zero order® it certainly holds. In particular (9) is true for 
any canonical product of finite order; it also holds for functions of maxi- 


mum or minimum type, order p. 
It is known that if F(z) is of integral order p, then’ 


, log M(r, F) 
(10) lim inf —————— 
r=0 n(r,F — a) 


(9) 


for every a, except possibly a single exceptional value of a. Since F(z) 
and F(z)—a belong to the same type, we deduce from (9) that if 
F(z) is of maximum or minimum type, order p, where p is an integer, then 


(11) 
r=0 m(r,F — a)d(r) 


for every a. If F(z) is of mean type then (11) need not hold for one ex- 
ceptional value of a. For example, ze* and 


are both functions of mean type, order 1. For each of these two func- 


tions 
log M(r, F) 


m = 
r== n(r, F — 0)(log r)*/? 
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7G. Valiron, Lectures on the General Theory of Integral Functions, 1923, p. 86. 
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